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cDNA FOR HUMAN METHYLENETETRAHYDROFOLATE REDUCTASE AND 
~ USES THEREOF 

RELATED APPLICATION 

This appl^^tion is a continuation-in-part of US 
patent %)plicX^n serial number 09/258,928 filed March 
1, 1999, wKich is a continuation in part of US patent 
applic^^^ion serial number 08/738,000 filed February 12, 
l^^S-^both of which are still pending. 

BACKGROUND OF THE INVENTION 

(a) Field of the Invention 

The invention relates to a cDNA probe for human 
methylenetetrahydrofolate reductase (MTHFR) , and its 
uses . 

(b) Description of Prior Art 

Folic acid derivatives are coenzymes for several 
critical single-carbon transfer reactions, including 
reactions in the biosynthesis of purines, thymidylate and 
methionine- Methylenetetrahydrofolate reductase (MTHFR; 
EC 1.5.1.20) catalyses the NADPH- linked reduction of 
5, 10 -methylenetetrahydrofolate to 5- 

methyltetrahydrofolate, a co- substrate for methylation of 
homocysteine to methionine. The porcine liver enzyme, a 
f lavoprotein, has been purified to homogeneity; it is a 
homodimer of 77-kDa subunits. Partial proteolysis of the 
porcine peptide has revealed two spatially distinct 
domains: an N-terminal domain of 40 kDa and a C-terTuinal 
domain of 37 kDa. The latter domain contains the binding 
site for the allosteric regulator S-adenosylmethionine . 

Hereditary deficiency of MTHFR, an autosomal 
recessive disorder, is the most common inborn error of 
folic acid metabolism. A block in the production of 
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methyltetrahydrof olate leads to elevated homocysteine 
with low to normal levels of methionine. Patients with 
severe deficiencies of MTHFR (0 -20% activity in 
fibroblasts) can have variable phenotypes. Developmental 
5 delay, mental retardation, motor and gait abnormalities, 
peripheral neuropathy, seizures and psychiatric dis- 
turbances have been reported in this group, although at 
least one patient with severe MTHFR deficiency was 
asymptomatic- Pathologic changes in the severe form 

10 include the vascular changes that have been found in 

other conditions with elevated homocysteine, as well as 
reduced neurotransmitter and methionine levels in the 
CNS. A milder deficiency of MTHFR (35-50% activity) has 
been described in patients with coronary artery disease 

15 (see below) . Genetic heterogeneity is likely, 

considering the diverse clinical features, the variable 
levels of enzyme activity, and the differential heat 
inactivation profiles of the reductase in patients' 
cells . 

2 0 Coronary artery disease (CAD) accounts for 25% of 

deaths of Canadians. Cardiovascular risk factors (male 
sex, family history, smoking, hypertension, 
dyslipoproteinemia and diabetes) account for approxi- 
mately 60 to 70% of the ability to discriminate CAD 

25 patients from healthy subjects. Elevated plasma homo- 
cysteine has also been shown to be an independent risk 
factor for cardiovascular disease. 



acid that is formed by the demethylation of methionine. 
3 0 It is normally metabolized to cysteine (transsulfuration) 
or re-methylated to methionine. Inborn errors of 
metabolism (as in severe MTHFR deficiency) causing 
extreme elevations of homocysteine in plasma, with 



Homocysteine is a sulfhydryl -containing amino 
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homocystinuria, are associated with premature vascular 
disease and widespread arterial and venous thrombotic 
phenomena. Milder elevations of plasma homocysteine (as 
in mild MTHFR deficiency) have been associated with the 
development of peripheral vascular disease, 
cerebrovascular disease and premature CAD. 

Homocysteine remethylation to methionine requires 
the folic acid intermediate, 5-methyltetrahydrof olate, 
which is produced from 5 , 10-methylenetetrahydrofolate 
folate through the action of 5, 10- 
methylenetetrahydrofolate reductase (MTHFR) . Deficiency 
of MTHFR results in an inability to metabolize 
homocysteine to methionine; elevated plasma homocysteine 
and decreased methionine are the metabolic consequences 
15 of the block. Severe deficiencies of MTHFR (less than 
20% of activity of controls) as described above, are 
associated with early-onset neurologic symptoms (mental 
retardation, peripheral neuropathy, seizures, etc.) and 
with atherosclerotic changes and thromboembolism. Milder 
20 deficiencies of MTHFR (35-50% of activity of controls), 
with a thermolabile form of the enzyme, are seen in 
patients with cardiovascular disease without obvious 
neurologic abnormalities. 

In a survey of 212 patients with proven coronary 
25 artery disease, the thermolabile form of MTHFR was found 
in 17% of the CAD group and 5% of controls. In a 
subsequent report on 33 9 subjects who underwent coronary 
angiography, a correlation was found between thermolabile 
MTHFR and the degree of coronary artery stenosis. Again, 
traditional risk factors (age, sex, smoking, 
hypertension, etc.) were not significantly associated 
with thermolabile MTHFR. All the studies on MTHFR were 
performed by enzymatic assays of MTHFR in lymphocytes, 
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with measurements of activity before and after heat 
treatment to determine thermolability of the enzyme. 

Since 5-methyltetrahydrofolate, the product of 
the MTHFR reaction, is the primary form of circulatory 
folate, a deficiency in MTHFR might lead to other types 
of disorders. For example, periconceptual folate 
administration to women reduces the occurrence and 
recurrence of neural tube defects in their offspring. 
Neural tube defects are a group of developmental mal- 
formations (meningomyelocele, anencephaly, and 
encephalocele) that arise due to failure of closure of 
the neural tube. Elevated levels of plasma homocysteine 
have been reported in mothers of children with neural 
tube defects. The elevated plasma homocysteine could be 
due to a deficiency of MTHFR, as described above for car- 
diovascular disease. 

Neuroblastomas are tumors derived from neural 
crest cells. Many of these tumors have been reported to 
have deletions of human chromosome region lp36, the 
region of the genome to which MTHFR has been mapped. It 
is possible that MTHFR deletions/mutations are 
responsible for or contribute to the formation of this 
type of tumor. MTHFR abnormalities may also contribution 
to the formation of other types of tumors, such as 
colorectal tumors, since high dietary folate has been 
shown to be inversely associated with risk of colorectal 
carcinomas . 

MTHFR activity is required for homocysteine 
methylation to methionine. Methionine is necessary for 
the formation of S-adenosylmethionine, the primary methyl 
donor for methylation of DNA, proteins, lipids, 
neurotransmitters, etc. Abnormalities in MTHFR might 
lead to lower levels of methionine and S-adenosylme- 



thionine, as well as to elevated homocysteine. Disrup- 
tion of methylation processes could result in a wide 
variety of conditions, such as neoplasias, developmental 

anomalies, neurologic disorders, etc. 

Although the MTHFR gene in Escherichia coli 

irnetF) has been isolated and sequenced, molecular studies 
of the enzyme in higher organisms have been limited 
without the availability of an eukaryotic cDNA. 

It would be highly desirable to be provided with 
a cDNA probe for human methylenetetrahydrof olate 
reductase (MTHFR) . This probe would be used for iden- 
tification of sequence abnormalities in individuals with 
severe or mild MTHFR deficiency, including cardiovascular 
patients and patients with neurologic symptoms or tumors. 
The probe would also be used in gene therapy, isolation 
of the gene, and expression studies to produce the MTHFR 
protein. The probe would also provide the amino acid 
sequence of the human MTHFR protein, which would- be 
useful for therapy of MTHFR deficiency by biochemical or 
pharmacological approaches . 

It would be highly desirable to be provided with 
a molecular description of mutations in methyl- 
enetetrahydrof olate reductase deficiency. 

Patients with sequence abnormalities in MTHFR 
might have different responses to drugs, possibly but not 
limited to drugs that affect folate metabolism. 
Therefore, it would be useful to know if these mutations 
are present before determining the appropriate therapy. 
The drugs/diseases for which this might be relevant 
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include cancer chemotherapeutic agents, antibiotics, 
antiepileptic medication, antiarthritic medication, etc. 

SUMMARY OF THE INVENTION 

5 One aim of the present invention is to provide a 

cDNA probe for human methylenetetrahydrof olate reductase 
(MTHFR) . 

Another aim of the present invention is to pro- 
vide a molecular description of mutations in methyl - 
10 enetetrahydrof olate reductase deficiency. 

Another aim of the present invention is to pro- 
vide a nucleic acid and amino acid sequence for human 
methylenetetrahydrof olate reductase - 

Another aim of the present invention is to pro- 
15 vide potential therapy for individuals with methyl- 
enetetrahydrof olate reductase deficiency. 

Another aim of the present invention is to pro- 
vide a system for synthesis of MTHFR protein in vitro. 
A further aim of the present invention is to 
20 provide technology/protocol for identification of 
sequence changes in the MTHFR gene. 

In accordance with one aspect of the present 
invention, there is provided a cDNA probe for human 
methylenetetrahydrofolate reductase (MTHFR) gene encoded 
25 by a nucleotide sequence as set forth in SEQ ID N0:1 or 
having an amino acid sequence as set forth in SEQ ID 
NO: 2. The probe comprises a nucleotide sequence that 
hybridizes to the MTHFR nucleotide sequence, or an amino 



acid sequence that hybridizes to the MTHFR amino acid 
sequence . 

In accordance with another aspect of the present 
invention, there is provided a method of diagnosis of 
methylenetetrahydrof olate reductase (MTHFR) deficiency in 
a patient with MTHFR deficiency. The method comprises 
the steps of amplifying a DNA sample obtained from the 
patient or reverse- transcript ing a RNA sample obtained 
from the patient into a DNA and amplifying the DNA, and 
analyzing the amplified DNA to determine at least one 
sequence abnormality with respect to a human MTHFR 
encoded by a nucleotide sequence as set forth in SEQ ID 
N0:1 or having an amino acid sequence as set forth in SEQ 
ID N0:2, the sequence abnormality being indicative of 
MTHFR deficiency. 

The sequence abnormality may comprise a mutation 
selected from a group consisting of 167G/A, 482G/A, 
559C/T, 677G/T, 692C/T, 764C/T, 792+lG/A, 985C/T, 
1015C/T, 1081C/T, 1298A/C and 1317T/C. 

The selected mutation may consist of 677C/T. 
The MTHFR deficiency may be associated with a 
disease, disorder, or dsyfuntion selected from a group 
consisting of cardiovascular disorders, cancer, 
osteoporosis, increased risk of occurrence of a neural 
tube defect in an offspring of said patient, neurological 
disorders, disorders influenced by folic acid metabolism, 
metabolic or endocrine disease, inborn errors of 
metabolism, inflammation, immune disorders, psychiatric 
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illness, neoplastic disease and other related disorders, 
and renal disease. 

The cancer may be selected from a group 
consisting of neuroblastomas and colorectal carcinomas. 
5 The disorder may consist of osteoporosis. 

In accordance with yet another aspect of the 
present invention, there is provided a method for gene 
therapy of methylenetetrahydrof olate reductase (MTHFR) 
deficiency in a patient. The method comprises the steps 
10 of producing a recombinant vector for expression of MTHFR 
under the control of a suitable promoter, the MTHFR being 
encoded by a nucleotide sequence as set forth in SEQ ID 
N0:1 or having an amino acid sequence as set forth in SEQ 
ID N0:2, and .transfecting the patient with the vector for 
15 expression of MTHFR. 

In accordance with yet another aspect of the 
present invention, there is provided a human 
methylenetetrahydrofolate reductase (MTHFR) protein 
encoded by a nucleotide sequence as set forth in SEQ ID 
20 N0:1 or having an amino acid sequence as set forth in SEQ 
ID NO : 2 . 

In accordance with yet another aspect of the 
present invention, there is provided a recombinant human 
methylenetetrahydrofolate reductase (MTHFR) protein 
25 encoded by a nucleotide sequence as set forth in SEQ ID 

NO:l or having an amino acid sequence as set forth in SEQ 
ID NO: 2. 

In accordance with yet another aspect of the 
present invention, there is provided a method of 



treatment of MTHFR- deficiency in a patient that comprises 
administering such a MTHFR protein. 

The MTHFR deficiency may be associated with a 

cancer. 

The cancer may be selected from a group 
consisting of neuroblastomas and colorectal carcinomas. 

In accordance with yet another aspect of the 
present invention, there is provided a method of 
preventing an occurrence of a neural tube defect in an 
offspring of a patient. The method comprises 
administering to the patient such a MTHFR protein. 

In accordance with yet another aspect of the 
present invention, there is provided a method for 
determining drug susceptibility, drug response or drug 
toxicity of a patient having a methylenetetrahydrof olate 
reductase (MTHFR) deficiency. The method comprises the 
steps of amplifying a DNA sample obtained from the 
patient or reverse-transcripting a RNA sample obtained 
from the patient into a DNA and amplifying said DNA, 
analyzing the amplified DNA to determine a sequence 
abnormality in a MTHFR sequence, the MTHFR sequence being 
encoded by a nucleotide sequence as set forth in SEQ ID 
N0:1 or having an amino acid sequence as set forth in SEQ 
ID NO: 2, and administering the drug to the patient and 
determining the sequence abnormality associated with the 
patient susceptibility, response or toxicity to the drug. 
Further, in human therapeutic approaches, both 
pharmacokinetics and pharmacodynamics may be influenced 
by folic acid metabolism. Pharmacokinetics includes, but 



is not excluded to, factors such as absorption, 
distribution, metabolism, and excretion. 
Other considerations include safety and toxicity. 
Examples of toxicities from therapeutics include, but are 
not limited to, blood dyscrasias, cutaneous toxicities, 
systemic toxicity, CNS toxicity, hepatic toxicity, 
cardiovascular toxicity, pulmonary toxicity, and renal 
toxicity. 

The sequence abnormality may comprise a mutation 
selected from a group consisting of 167G/A, 482G/A, 
559C/T, 677C/T, 692C/T, 764C/T, 792+lG/A, 985C/T, 
1015C/T, 1081C/T, 1298A/C and 1317T/C and the drug may be 
selected from a group consisting of cancer 
chemotherapeutic agents, antibiotics, antiepileptic 
agents, antiarthritic agents, and ant i- inflammatory 
agents. Examples of chemotherapeutic agents include 
alkylating agents (nitrogen mustards, ethylenimines and 
methylmelamines, alkyl sulfonates, nitrosoureas, 
triazenes), antimetabolites (folic acid analogs, 
pyrimidine analogs, purine analogs and related 
inhibitors), and natural products (vinca alkyloids, 
epipodophyllotoxins, antibiotics, enzymes, biological 
response modifiers) . Examples of ant i- inflammatory 
agents include non-steroidal ant i- inflammatory agents, 
steroids, antihistaminergics, .5-LO inhibitors, cytokine 
agonists, and cytokine antagonists. The drug may also 
include therapies for metabolic or endocrine disease such 
as hormone agonists and antagonists, intracellular 
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response modifiers, and secretegogues . Other possible 
drugs include therapies for cardiovasular disease or 
disorders such as antianemic, antiangina, antiarrythmics, 
antihypertensives, positive inotropic agents, and 
antithrombotics. The drug may also be a therapy for an 
immune disorder, such as immune modulators, 
immunosuppression agents, and immunostimulation agents. 
Additionally, any other drug for the treatment of a 
metabolic disease, inborn errors of metabolism, 
psychiatric disorders, neoplastic disease and other 
related disorders, and renal disease is relevant to the 
invention 

The MTHFR deficiency may be associated with a 
disorder selected from a group consisting of 
cardiovascular disorders, coronary and arterial 
disorders, neurological disorders, increased risk of 
occurrence of a neural tube defect in an offspring, 
cancer, osteoporosis and other disorders influenced by 
folic acid metabolism. 

In accordance with yet another aspect of the 
present invention, there is provided a method of 
treatment of a patient having a cancer comprising the 
step of inhibiting gene expression for a MTHFR protein or 
a mRNA produced form the gene. 

In accordance with yet another aspect of the 
present invention, there is provided a method of 
treatment of a patient having a cancer comprising the 
step of inhibiting the MTHFR protein. 
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The use of MTHFR alleles as diagnostic, 
therapeutic, prognostic, and pharmacogenomic markers may 
also be applied to neurological disorders such as 
psychoses and to other diseases and disorders that may be 
treated with ant i -psychotic therapeutics. The presence 
of particular MTHFR alleles may be used to predict the 
safety and efficacy of a particular therapy and to select 
preferred therapies for these subjects. 

Accordingly, in one aspect, the invention provides a 
method of diagnosing a psychosis in a subject. This 
method involves analyzing the MTHFR nucleic acid in a 
sample obtained from the subject and determining the 
presence of at least one heterozygous MTHFR mutant allele 
in the subject that is indicative of the subject having 
the psychosis. 

In a related aspect, the invention features a method 
of determining a risk for a psychosis or a propensity for 
a psychosis in a subject. This method includes analyzing 
the MTHFR nucleic acid in a sample obtained from said 
subject and determining the presence of at least one 
heterozygous MTHFR mutant allele in the subject that is 
indicative of the risk for a psychosis or the propensity 
for a psychosis in the subject. 

In another aspect, the invention provides a method 
of diagnosing a psychosis in a subject. This method 
involves analyzing the MTHFR nucleic acid in a sample 
obtained from the subject and determining the presence of 
a heterozygous MTHFR mutation at position 677 and/or the 
presence of at least one other MTHFR mutation at a 
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position other than 677. These mutations are indicative 
of the subject having the psychosis. 

The invention also provides a related method of 
diagnosing a risk for a psychosis or a propensity for a 
psychosis in a subject. This method includes analyzing 
the MTHFR nucleic acid in a sample obtained from the 
subject and determining the presence of a heterozygous 
MTHFR mutation at position 677 and the presence of at 
least one other MTHFR mutation at a position other than 
677. These mutations are indicative of the risk for a 
psychosis or the propensity for a psychosis in the 
sub j ect . 

In addition to their use in diagnosing a psychosis 
or an increased risk for a psychosis, the determination 
of one or more mutations in an MTHFR allele may also be 
used to classify subjects into a subgroup for a clinical 
trial of an ant i -psychotic therapy, to determine a 
preferred anti -psychotic therapy for a subject, or to 
determine whether the MTHFR mutations are indicative of a 
response to an anti -psychotic therapy. 

Accordingly, the invention also provides a method 
for stratification of subjects involved in a clinical 
trial of an anti -psychotic therapy. This method involves 
analyzing the MTHFR nucleic acid in a sample obtained 
from a subject and determining before, during, or after 
the clinical trial the presence of at least one MTHFR 
mutant allele in the subject that places the subject into 
a subgroup for the clinical trial. In one preferred 
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embodiment, the ant i -psychotic therapy is a therapy for 
schizophrenia . 

In another aspect, the invention provides a method 
for selecting a therapy for a subject suffering from a 
psychosis. This method include analyzing the MTHFR 
nucleic acid in a sample obtained from the subject and 
determining the presence of at least one MTHFR mutant 
allele in the subject that is indicative of the safety or 
efficacy of the therapy. 

In yet another aspect, the invention features a 
method for determining whether a mutant MTHFR allele is 
indicative of a response to a therapy for a psychosis. 
This method includes (a) determining whether the response 
of a first subject or set of subjects at increased risk 
for or diagnosed with the psychosis differs from the 
response of a second subject or set of subjects at 
increased risk for or diagnosed with the psychosis, (b) 
analyzing the MTHFR nucleic acid in a sample obtained 
from the first and second subjects or sets of subjects, 
and (c) determining whether at least one MTHFR mutant 
allele differs between the first and second subjects or 
sets of subjects. If the MTHFR mutant allele is 
correlated to a response to the therapy, the mutant 
allele is determined to be indicative of the safety or 
efficacy of the therapy. 

The invention also provides a method for preventing, 
delaying, or treating a psychosis in a subject. This 
method includes (a) analyzing the MTHFR nucleic acid in a 
sample obtained from the subject, (b) determining the 
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presence of at least one MTHFR mutant allele in the 
subject that is predictive of the safety or efficacy of 
at least one ant i -psychotic therapy, (c) determining a 
preferred therapy for the subject, and (d) administering 
the preferred therapy to the subject. 

In another aspect, the invention provides a 
pharmaceutical composition including a compound which has 
a differential effect in subjects having at least one 
copy of a particular MTHFR allele and has a 
pharmaceutically acceptable carrier, excipient, or 
diluent. This compound is preferentially effective to 
treat a subject having the particular MTHFR allele. In a 
preferred embodiment, the composition is adapted to be 
preferentially effective based on the unit dosage, 
presence of additional active components, complexing of 
the compound with stabilizing components, or inclusion of 
components enhancing delivery or slowing excretion of the 
compound. In another preferred embodiment, the compound 
is deleterious to subjects having at least one copy of 
the particular MTHFR allele or in subjects not having at 
least one copy of the particular MTHFR allele, but not in 
both. In yet another preferred embodiment, the subject 
suffers from a disease or condition selected from the 
group consisting of amyotrophic lateral sclerosis, 
anxiety, dementia, depression, epilepsy, Huntington's 
disease, migraine, demyelinating disease, multiple 
sclerosis, pain, Parkinson's disease, schizophrenia, 
psychoses, and stroke. In still another preferred 
embodiment, the pharmaceutical composition is subject to 
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a regulatory restriction or reconutiendation for use of a 
diagnostic test determining the presence or absence of at 
least one particular MTHFR allele. In another preferred 
enibodiment, the pharmaceutical composition "is subject to 
a regulatory limitation or recommendation restricting or 
recommending restriction of the use of the pharmaceutical 
composition to subjects having at least one particular 
MTHFR allele. In yet another preferred embodiment, the 
pharmaceutical composition is subject to a regulatory 
limitation or recommendation indicating the 
pharmaceutical composition is not to be used in subjects 
having at least one particular MTHFR allele. 
In still another preferred embodiment, the pharmaceutical 
composition is packaged, and the packaging includes a 
label or insert restricting or recommending the 
restriction of the use of the pharmaceutical composition 
to subjects having at least one particular MTHFR allele. 
In yet another embodiment, the pharmaceutical composition 
is packaged, and the packaging includes a label or insert 
requiring or recommending the use of a test to determine 
the presence or absence of at least one particular MTHFR 
allele in a subject. Preferably, the compound is an 
ant i -psychotic therapy, such as a therapy for 
schizophrenia . 

In preferred embodiments of each of the aspects of 
the invention, the mutant MTHFR allele encodes an MTHFR 
protein with reduced activity or reduced thermal 
stability. It is also contemplated that the MTHFR 
protein may have a reduced half -life. Preferably, the 
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MTHFR mutation is a missense mutation. Preferred MTHFR 
mutations include a G/A mutation at position 167, a G/A 
mutation at position 482, a C/T mutation a position 559, 
a C/T mutation at position 677, a C/T mutation at 
position 692, a C/T mutation at position 764, a G/A 
mutation at position 792+1, a C/T mutation at position 
985, a C/T mutation at position 1015, a C/T mutation at 
position 1081, an A/C mutation at position 1298, and a 
T/C mutation at position 1317. It is also contemplated 
that the mutations at these positions may involve changes 
to other nucleotides. In other preferred embodiments, the 
subject has at least two MTHFR mutant alleles. 
Preferably the two or more mutant MTHFR alleles have at 
least one of the following: a G/A mutation at position 
167, a G/A mutation at position 482, a C/T mutation a 
position 559, a C/T mutation at position 677, a C/T 
mutation at position 692, a C/T mutation at position 764, 
a G/A mutation at position 792+1, a C/T mutation at 
position 985, a C/T mutation at position 1015, a C/T 
mutation at position 1081, an A/C mutation at position 
1298, or a T/C mutation at position 1317. In yet other 
preferred embodiments, the MTHFR mutation results in a 
change in the amino acid at position 226 of the encoded 
protein from alanine found in wild-type MTHFR to any 
other amino acid, including valine or residues other than 
valine. In still other preferred embodiments, the 
mutation results in a change in amino acid 433 of the 
encoded protein from glutamic acid to any other amino 
acid, including alanine or residues other than alanine. 
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When two mutations are present at different nucleotide 
positions, the mutations may be in the same or different 
MTHFR alleles. The MTHFR mutation may, without 
limitation, also be an insertion, deletion, or frameshift 
5 mutation. 

Preferably, the psychosis being diagnosed or treated 
is schizophrenia. The psychosis may also be any other 
psychosis such as manic-depressive disease, depression 
with psychotic features, organic psychotic disorders, 
10 psychosis in alcohol or drug intoxication, postinfection 
psychosis, postpartum psychosis, senile psychosis, 
traumatic psychosis, manic-depressive psychosis, 
psychosis from toxic agents, and acute idiopathic 
psychotic illnesses. 
15 The use of MTHFR alleles as diagnostic, therapeutic, 

prognostic, and pharmacogenomic markers may also be 
applied to other neurological disorders. Examples of 
neurological disorders include, but are not limited to, 
mood disorders, neurodegenerative diseases, cognitive 
20 disorders, seizure disorders such as epilepsy, headaches 
such as migraines, pain associated with the central 
nervous system, demyelinating diseases, spasticity, 
neural tube defects, neurofibromatosis, and ischemic 
cerebrovascular diseases such as thrombotic and 
25 hemorrhagic strokes. Mood disorders include but are not 
excluded to conditions such as unipolar depression, 
bipolar depression and anxiety. Examples of 
neurodegenerative diseases include amyotrophic lateral 
sclerosis, Parkinson's disease, acquired or symptomatic 



parkinsonism, Huntington's disease, and 
heredodegenerative disease. Examples of cognitive 
disorders include dementia, dementia of the Alzheimer's 
type, vascular dementia, multi-inf arct dementia and 
posttraumatic dementia. Demyelinating diseases consist 
of diseases such as the following: multiple sclerosis 

(MS), Marburg and Balo forms of MS, neuromyelitis optica, 
perivenous encephalitits, acute disseminated 

encephalomyelitits , 

and acute necotizing hemmorhhagic encephalitis. 

The methods of the invention may also be applied to 
non-psychiatric conditions for which ant i -psychotic drugs 
are used. Examples of these conditions include nausea, 
vomiting, movement disorders associated with 
neurodegenerative diseases such as Huntington's disease 
and Tourette's syndrome, pruritis, and chronic hiccough. 
Cardiovascular disorders, cancers, neoplastic disease and 
other related disorders, osteoporosis, neural tube 
defects, neurological disorders, disorders influenced by 
folic acid metabolism, metabolic or endocrine disease, 
inborn errors of metabolism, inflammation, immune 
disorders, psychiatric illness, and renal disease are 
also relevant to these methods. 

It is to be understood that the methods of the 
invention may be applied to any other diseases or 
disorders. Examples of other diseases or disorders may 
include those that are indirectly or directly affected by 
an increase or decrease in MTHFR activity, homocysteine 
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levels, methionine levels, S-adenosylmethionine activity, 
or methylation. 

The therapies relevant to the methods of the present 
invention include all clinically used therapies and 
5 therapies in preclinical or clinical development for the 
aforementioned diseases and disorders. Preferred 
therapies include the therapies listed in databases such 
as Life Cycle, R&D Focus, IMS World Publications, London 
UK or in any other database of therapies that are in 
10 development and/or that are FDA approved. Preferred 
ant i -psychotic therapies include conventional 
neuroleptics such as phenothiazines (e.g. 
chlorpromazine) , thioxanthenes (e.g. thiothixene), 
butyrophenones (e.g. haloperidol, one of the most useful 
15 conventional anti-psychotics) , dibenzoxazepines, 

dibenzodiazepines, diphenylbutylpiperidines, and other 
heterocyclic compounds . Preferred atypical neuroleptics 
include compounds such as Clozaril, risperidone, 
olanzapine, quetiapine, ziprasidone, amisulpride, 
20 sertindole, and iloperidone. Additionally, anti- 
psychotic therapies, such as loxapine, may have 
pharmacology intermediate between that of conventional 
and atypical neuroleptics. In preferred embodiments, the 
treatment or prevention of a neurological disease 
25 includes administration of one or more therapeutically 
active compounds that bind to a dopamine, histamine, 
muscarinic cholinergic, a- adrenergic, or serotonin 
receptors . 
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Examples of other pharmaceutically active compounds 
which may be used as anti-psychotics include synthetic 
organic molecules, naturally occurring organic molecules, 
nucleic acid molecules, biosynthetic proteins or 
peptides, naturally occurring peptides or proteins, and 
modified naturally occurring peptides or proteins. In 
preferred embodiments, the preferred therapy consists of 
an altered dose regime compared to other clinically used 
dose regimes or to the administration of a combination of 
therapeutics. It is also contemplated that the preferred 
therapy may consist of other medical treatments, such as 
surgical procedures, electroconvulsive therapy, or 
psychotherapy. Examples of psychotherapies include 
dynamic psychotherapy, cognitive -behavioral therapy, 
interpersonal therapy, behavioral therapy, group 
psychotherapy, and family therapy. For the 
administration of a therapeutically active compound to a 
subject, any mode of administration or dosing regime may 
be used. It is to be understood that for any particular 
subject, specific dosage regimes should be adjusted over 
time according to the individual need and the 
professional judgment of the person administering or 
supervising the administration of the compositions. 

Descriptions of clinically used therapies (for 
example, for psychosis) and guidelines for their use are 
readily available in common references such as the 
Physicians Desk Reference (PDR) . Additionally, 
therapeutics are described and listed in medical, 
pharmacy, and pharmacology journals and textbooks, such 
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as Goodman and Oilman's The Pharamcological Basis of 
Therapeutics, (Hardman and Limbird (eds.) Ninth Edition, 
McGraw-Hill, New York, 1996) . Additional information on 
FDA approved therapeutics and therapies in clinical 
trials for particular indications, diseases, or disorders 
may be easily found in electronic databases available 
through subscription and pharmaceutical company 
information. 

By "preferred therapy" is meant a therapy for a 
disease, disorder, or dysfunction that is efficacious, 
safe, and/or has reduced toxicity compared to another 
therapy for the disease or disorder. The preferred 
therapy selected based on one or more mutations in one or 
more MTHFR alleles is preferably effective in at least 
20, 30, 50, 70, or 90% of the subjects having the MTHFR 
mutation(s) . By "effective" is meant ablates, reduces, 
or stabilizes symptoms in a subject suffering from a 
disease, disorder, or dysfunction prevents the onset of 
symptoms in a subject at risk for a disease, disorder, or 
dysfuction, or results in a later age-at-onset of 
symptoms in the subject compared to the average age-at- 
onset for the corresponding untreated subjects with the 
same MTHFR mutation (s) . The effectiveness of the therapy 
for the treatment of a psychosis may be evaluated based 
on the Clinical Global Impression (CGI), Diagnostic 
Interview for Genetic Studies (DIGS), Global Assessment 
Score (GAS) , or Brief Psychiatric Rating Scale (BPRS) 
using standard procedures. Preferably, the total BPRS 
score is lowered by 5, more preferably 10, and most 
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preferably 15 points. By "reduced toxicity" is meant 
lower level of adverse pharmacological or physiological 
effects. Preferably, the therapy produces clinically 
unacceptable side-effects in 5, 10, 20, 30, or 50% fewer 
5 of the subjects having the MTHFR mutations (s) than in 
subjects who do not have one or ore of the MTHFR 
mutations . 

A "polymorphism" is intended to mean a mutation or 
allelic variance present in 1% or more of alleles of the 
10 general population. A polymorphism is disease-causing 

when it is present in patients with a disease but not in 
the general population. However, a polymorphism present 
both in patients having a disease and in the general 
population is not necessarily benign. The definition of a 
15 disease-causing substitution, as distinct from a benign 
polymorphism, is based on 3 factors: (1) absence of the 
change in at least 50 independent control chromosomes; 
(2) presence of the amino acid in the bacterial enzyme, 
attesting to its evolutionary significance and (3) change 
20 in amino acid not conservative. Although expression of 

the substitutions is required to formally prove that they 
are not benign, the criteria above allow us to postulate 
that the changes described in this report are likely to 
affect activity. 

25 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figs. lA to IF illustrate the first cDNA coding 
sequence (SEQ ID N0:1 and NO: 2) for 
methylenetetrahydrof olate reductase (MTHFR) ; 



Fig. 2 is the alignment of amino acids for human 
methylenetetrahydrof olate reductase (MTHFR) , the metF 
genes from E. Coli (ECOMETF) , and S. Typhimurium 
(STYMETF) , and an unidentified open reading frame in 
Saccharomyces cerevisiae that is divergently transcribed 
from an excision repair gene (ysRADl) ; 

Figs. 3A and 3B illustrate the sequencing and 
restriction enzyme analysis for the Arg to Ter substi- 
tution; 

Figs. 4A and 4B illustrate the sequencing and 
restriction enzyme analysis for the Arg to Gin substi- 
tution; 

Figs. 5A and 5B illustrate the sequence change 
and restriction enzyme analysis for the alanine to valine 
substitution; 

Figs. 6A to 6C illustrate the total available 
sequence (SEQ ID NO: 3 and NO: 4) of human MTHFR cDNA; 

Figs. 7A and 7B illustrate the expression 
analysis of MTHFR cDNA in B. Coli, respectively (7 A) the 
Pfestern blot of bacterial extracts and tissues, and (7B) 
the thermolability assay of bacterial extracts; 

Figs. 8A to 8D illustrate the identification of a 
5* splice site mutation leading to a 57-bp in-frame 
deletion of the cDNA; 

Figs. 9A to 9D illustrate the diagnostic 
restriction endonuclease analysis of 4 mutations; 

Figs. lOA to lOD illustrate the ASO hybridization 
analysis of 2 mutations; 

Fig. 11 illustrates the region of homology 
between human methylenetetrahydrof olate reductase (MTHFR) 
and human dihydrof olate reductase (DHFR) ; 



Figs. 12A-12B illustrate the exonic sequences of 
the human MTHFR gene with their flanking intronic 
sequences ; 

Figs. 13A-13B illustrate the exonic sequences of 
the mouse MTHFR gene with their flanking intronic 
sequences ; 

Fig. 14 illustrates intron sizes and locations 
for both human and mouse genes; and 

Fig. 15 illustrates the alignment of MTHFR amino 
acid sequences for the human MTHFR (hMTHFR) , mouse MTHFR 
(mMTHFR) and the MetF gene of bacteria (bMTHFR) . 

DETAILED DESCRIPTION OF THE INVENTION 

Sequencing of peptides from porcine MTHFR 

Homogeneous native porcine MTHFR was digested 
with trypsin to generate a 40 kDa N-terminal fragment and 
a 31 IcDa C-terminal fragment; the 31 kDa fragment is a 
proteolytic product of the 37 kDa fragment. The 
fragments were separated by SDS-PAGE, electroeluted, and 
the denatured fragments were digested with lysyl 
endopeptidase (LysC) . The resulting peptides were 
separated by reversed-phase HPLC and subjected to 
sequence analysis by Edman degradation (details contained 
in Goyette P et al., Nature Genetics, 1994, 7:195-200). 

Isolation and sequencing of cDNAs 

Two degenerate oligonucleotides were synthesized 
based on the sequence of a 30 amino acid porcine MTHFR 
peptide (first underlined peptide in Fig. 2) . These 
were used to generate a 90 bp PGR product, encoding the 
predicted peptide, from reverse transcript ion- PGR 
reactions of 500 ng pig liver polyA+ RNA. A pig-specific 
(non- degenerate, antisense) PGR primer was then 
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synthesized from this short cDNA sequence. Using this 
primer and a primer for phage arms, a human liver XgtlO 
cDNA library (Clontech) was screened by PGR; this 
technique involved the generation of phage lysate stocks 
5 (50,000 pfu) which were boiled for 5 min and then used 
directly in PGR reactions with these two primers. PGR 
fragments were then sequenced directly (Cycle Sequencing™ 
kit, GIBCO) , and a positive clone was identified by com- 
parison of the deduced amino acid sequence to the 
10 sequence of the pig peptide (allowing for inter-species 
variations) . The positive stock was then replated at 
lower density and screened with the radiolabelled 
positive PGR product by plaque hybridization until a 
well-isolated plaque was identified. Phage DNA was 
15 purified and the insert was then subcloned into pBS+ 
(Bluescript) and sequenced on both strands (Cycle 
Sequencing™ kit, GIBCO and Sequenase™, Pharmacia) . The 
deduced amino acid sequence of the human cDNA was aligned 
to the porcine peptide sequences, the metF genes from E. 
20 coli {ecometf, accession number VO1502) and S. 

Typhimurium {stymetF, accession number X07689) and with a 
previously unidentified open reading frame in 
Saccharomyces cerevisiae that is divergently transcribed 
with respect to the excision repair gene, ysRADl 
25 (accession number KO2070) . The initial alignments were 

performed using BestFit™ in the GCG computer package, and 
these alignments were adjusted manually to maximize 
homologies . 

In summary, degenerate oligonucleotide primers 
30 were designed to amplify a sequence corresponding to a 
30 -amino acid segment of a porcine peptide from the N- 
terminal region of the enzyme (first porcine peptide in 
Fig. 2) . A 90-bp porcine cDNA fragment was obtained from 



reverse transcription/PCR of pig liver RNA. Sequencing 
of the PGR fragment confirmed its identity by comparison 
of the deduced amino acid sequence to the porcine peptide 
sequence- A nondegenerate oligonucleotide primer, based 
on the internal sequence of the porcine cDNA, was used in 
conjunction with primers for the phage arms to screen a 
human liver A.gtlO cDNA library by PGR. The insert of the 
positive clone was isolated and sequenced. The sequence 
consisted of 1266 bp with one continuous open reading 
frame . 

Homology with MTHFR in other species 

The deduced amino acid sequence of the human cDNA 
was aligned with the metF genes from E. coli and S. 
typhimurium, as well as with a previously unidentified 
ORF in Saccharomyces cerevisiae that is divergently 
transcribed with respect to the excision repair gene, 
ysRADl (Fig. 2) . The sequences homologous to 5 porcine 
peptides are underlined in Fig. 2. Three segments 
(residues 61-94, 219-240, and 337-351) correspond to 
internal peptide sequence from the N-terminal 40-kDa 
domain of the porcine liver enzyme. Residues 374-393 
correspond to the upstream portion of the LysG peptide 
from the G-terminal domain of the porcine liver enzyme 
that is labeled when the enzyme is irradiated with UV 
light in the presence of (^H-methyl) AdoMet ; as predicted 
from the AdoMet labeling studies, this peptide lies at 
one. end (N-terminal) of the 37 kDa domain. A fifth 
region of homology (residues 359-372) was also 
identified, but the localization of the porcine peptide 
within the native protein had not been previously 
determined. 
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Methylenetetrahydrofolate reductase (MTHFR) is an 
enzyme involved in amino acid metabolism, that is 
critical for maintaining an adequate methionine pool, as 
well as for ensuring that the homocysteine concentration 
5 does not reach toxic levels. The high degree of sequence 
conservation, from E. coli to Homo sapiens, attests to 
the significance of MTHFR in these species. The enzyme 
in E. coli (encoded by the metF locus) is a 33-kDa 
peptide that binds reduced FAD and catalyzes the 
10 reduction of methylene tetrahydrofolate to 

methyltetrahydrof olate . The metF enzyme differs from the 
mammalian enzyme in that NADPH or NADH cannot reduce it, 
and its activity is not allosterically regulated by S- 
adenosylmethionine . The native porcine enzyme is 
15 susceptible to tryptic cleavage between the N-tenninal 40 
kDa domain and the C- terminal 37 kDa domain, and this 
cleavage results in the loss of allosteric regulation by 
adenosylmethionine, but does not result in loss of 
catalytic activity. Since the homology between the 
20 bacterial and mammalian enzymes is within the N-terminal 
domain, this region must contain the flavin binding site 
and residues necessary to bind the folate substrate and 
catalyze its reduction. The domain structure of the 
human enzyme has not been elucidated, although the human 
25 enzyme has been reported to have a molecular mass of 150 
kDa and is likely to be a homodimer of 77 kDa. 

The predicted point of cleavage between the two . 
domains lies between residues 351 and 374 of the human 
sequence, based on the localization of peptides obtained 
30 from the isolated domains of the porcine enzyme. This 

region, containing the highly charged sequence KRREED, is 
predicted to have the highest hydrophilicity and surface 
probability of any region in the deduced human sequence. 



- 29 - 



The N- terminus of the porcine protein has been 
sequenced, and the region encoding this part of the 
protein is missing from the human cDNA. It is estimated 
that this cDNA is missing only a few residues at the N- 
5 terminus, since the predicted molecular mass of the 

deduced sequence upstream of the putative cleavage site 
(KRREED) is 40 kDa, and the measured molecular mass of 
the porcine N-terminal domain is also 40 kDa. When the 
bacterial, yeast and human sequences are aligned, the 
10 deduced human sequence contains an N-terminal extension 
of 40 amino acids; it is suspected that this extension 
contains determinants for NADPH binding. Many pyridine 
nucleotide-dependent oxidoreductases contain such 
determinants at the N- terminus of the protein. 
15 The C-terminus of the human sequence contains a 

peptide that is labeled when the protein is irradiated 
with ultraviolet light in the presence of tritiated 
AdoMet . The cDNA, sequence reported here contains only 
about 7 kDa of the predicted 37 -kDa mass of this domain, 
20 indicating that this cDNA is truncated at the 3' terminus 
as well. A number of peptides from the C-terminal 
porcine domain have also not been detected. As might be 
expected, given that the prokaryotic enzymes do not 
appear to be allosterically regulated by AdoMet, there 
25 are no significant homologies between the C-terminal 

region in this cDNA and the prokaryotic metF sequences. 
The alignment shown in Fig. 2 shows that the homologous 
sequences terminate just prior to the putative cleavage 
site of the human enzyme. 



30 



Chromosomal assignment 

In situ hybridization to metaphase human chro- 
mosomes was used for localization of the human gene. The 
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analysis of the distribution of 200 silver grains 
revealed a significant clustering of grain 40 grains, in 
the p36.3-36.2 region of chromosome 1 (p<0.0001), with 
the majority of grains, 25 grains, observ^ed over lp36.3. 
5 The isolation of the human cDNA has allowed us to 

localize the gene to chromosome lp36.3. The observation 
of one strong signal on that chromosome with little 
background is highly suggestive of a single locus with no 
pseudogenes. Southern blotting of human DNA revealed 
10 fragments of approximately 10 Jcb, predicting a gene of 

average size, since this cDNA encodes approximately half 
of the coding sequence . 

Additional cDNA sequences and constructs for expression 
15 analysis 

A human colon carcinoma cDNA library (gift of Dr. 

Nicole Beauchemin, McGill University) was screened by 

plaque hybridization with the original 1.3-kb cDNA to 

obtain additional coding sequences. A cDNA of 2.2 kb was 

20 isolated, which contained 1.3 kb of overlapping sequence 

to the original cDNA and 900 additional bp at the 3' end 

(Fig. 6) . The amino acid sequence is identical to that 

of the original cDNA for the overlapping segment (codons 

1-415) except for codon 177 (ASP) which was a GLY codon 

25 in the original cDNA. Analysis of 50 control chromosomes 

revealed an ASP codon at this position. The cDNA has an 

open reading frame of 1980 bp, 100 bp of 3 • UTR and a 

poly A tail . 

Sequencing was performed on both strands for the 
30 entire cDNA. Additional 5' sequences (800 bp) were 

obtained from a human kidney cDNA library (Clontech) but 
these sequences did not contain additional coding 
sequences and were therefore used for the PCR-based 



mutagenesis only (as described below) and not for the 
expression analysis. The two cDNAs (2.2 kb and 800 bp) 
were ligated using the Eco-RI site at bp 199 and inserted 
into the Bluescript™ vector (Stratagene) . The 2.2 kb 
cDNA was subcloned into the expression vector pTrc99A 
(Pharmacia) using the Ncol site at bp 11 and the Xbal 
site in the polylinker region of both the Bluescript™ and 
the pTrc99A vectors. Sequencing was performed across the 
cloning sites to verify the wild-type construct. 

UTILITY OF INVENTION IN IDENTIFICATION OF MUTATIONS 

I. Identification of first two mutations in severe MTHFR 
deficiency 

Total RNA of skin fibroblasts from MTHFR-defi- 
cient patients was reverse-transcribed and amplified by 
PGR for analysis by the single strand conformation 
polymorphism (SSCP) method (Orita, M. et al . , Genomics, 
1989, 5:8874-8879). Primers were designed to generate 
fragments of 250-3 00 bp and to cover the available cDNA 
sequences with small regions of overlap for each fragment 
at both ends. The first mutation identified by SSCP was a 
C to T substitution at bp 559 in patient 1554; this 
substitution converted an arginine codon to a termination 
codon (Fig. 3A) . Since the mutation abolished a Fokl 
site, restriction digestion was used for confirmation of 
the change and for screening additional patients for this 
mutation; a second patient (1627) was identified in this 
manner (Fig. 3B) . The SSCP pattern for patient 1554 and 
the restriction digestion pattern for both patients was 
consistent with a homozygous mutant state or with a 
genetic compound consisting of the nonsense mutation with 
a second mutation that did not produce any detectable RNA 



(null allele) . Studies in the parents are required for 
confirmation. 

The second substitution (Fig. 4A) was a G to A 
transition at bp 482 in patient 1834 that converted an 
arginine into a glutamine residue. The substitution 
created a PstI site that was used to verify the sub- 
stitution and to identify a second patient (1863) with 
this change (Fig. 4B) . The SSCP analysis and the 
restriction digestion pattern were consistent with a 
heterozygous state for both patients. The arginine codon 
affected by this change is an evolutionarily conserved 
residue, as shown in Fig. 2. This observation, in 
conjunction with the fact that the codon change is not 
conservative, makes a strong argument that the 
substitution is a pathologic change rather than a benign 
polymorphism. Furthermore, 35 controls (of similar 
ethnic background to that of the probands) were tested 
for this substitution by Southern blotting of PstJ- 
digested DNA; all were negative. 

The family of patient 1834 was studied. The 
symptomatic brother and the mother of the proband were 
all shown to carry this substitution, whereas the father 
was negative for the change (Fig. 4B) . In the family of 
1863, the mother of the proband was shown to be a 
carrier, while the father and an unaffected brother were 
negative . 

Cell lines 

Cell line 1554 is from a Hopi male who was 
admitted at age three months with homocystinuria, sei- 
zures, dehydration, corneal clouding, hypotonia and 
Candida sepsis. Folate distribution in cultured 
fibroblasts showed a Pediococcus cerivisiae/Lactobacillus 
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casei (PC/LC) ratio of 0.52 (Control 0.14). There was no 
measurable methylenetetrahydrof olate reductase (MTHFR) 
activity (Control values = 9.7 and 15.1 nmoles/h/mg 
protein; residual activity after treatment of control 
5 extracts at 55°C for 20 min. = 28% and 31%) . 

Cell line 1627 is from a Choctaw male who pre- 
sented with poor feeding, apnea, failure to thrive, 
dehydration and homocystinuria at five weeks of age. He 
was subsequently found to have superior sagittal sinus 
10 thrombosis and hydrocephalus. The PC/LC ratio was 0.61 
and the specific activity of MTHFR was 0.1 nmoles/h/mg 
protein. There is consanguinity in that the maternal and 
paternal grandmothers are thought to be "distantly 
related" . 

15 Cell line 1779 is from a French Canadian male 

with homocystinuria who first had limb weakness, 
uncoordination, paresthesiae, and memory lapses at age 15 
years, and was wheelchair-bound in his early twenties. 
His brother (cell line 1834) also has homocystinuria, but 
20 is 37 years old and asymptomatic. Specific activity of 
MTHFR was 0.7 and 0.9 nmole/h/mg protein for 1779 and 
1834, respectively; the residual activity after heat 
treatment at 55°C was 0.9% and 0% for 1779 and 1834, 
. respectively. 
25 Cell line 1863 is from a white male who was 

diagnosed at age 21 years because of a progressive gait 
disturbance, spasticity, cerebral white matter 
degeneration, and homocystinuria. He had a brother who 
died at age 21 years of neurodegenerative disease. 
3 0 Specific activity of MTHFR in fibroblast extracts was 

1.76 nmoles/h/mg protein and the residual enzyme activity 
after treatment at 55°C was 3.6%. 



Mutation analysis 

Primers were designed from the cDNA sequence to 
generate 250-300 bp fragments that overlapped 50-75 bp at 
each end. The primer pairs were used in reverse 
transcription-PCR of 5/ig patient total fibroblast RNA. 
The PGR products were analyzed by a non-isotopic rapid 
SSCP protocol (PhastSystem™, Pharmacia) , which uses 
direct silver staining for detection of single strands. 
Any PGR products from patients showing a shift on SSGP 
gels were purified by NuSieve (FMG Bioproducts) and 
sequenced directly (Gycle Sequencing™ kit, GIBGO) to 
identify the change . If the change affected a 
restriction site, then a PGR product was digested with 
the appropriate restriction endonuclease and analyzed on 
polyacrylamide gels. To screen for the Arg to Gin 
mutation in controls, 5 ng of PstI- digested DNA was run 
on 0.8% agarose gels and analyzed by Southern blotting 
using the radiolabelled cDNA by standard techniques. 

II. Seven additional mutations at the methylene- 
tetrahydro folate reductase (MTHPR) locus with 
genotype: phenotype correlation in severe MTHFR 
deficiency 

It is reported hereinbelow the characterization 
of 7 additional mutations at this locus: 6 missense muta- 
tions and a 5' splice site defect which activates a 
cryptic splice site in the coding sequence. A preliminary 
analysis of the relationship between genotype and 
phenotype for all 9 mutations identified thus far at this 
locus is also reported. A nonsense mutation and 2 
missense mutations (proline to leucine and threonine to 
methionine) in the homozygous state are associated with 
extremely low activity (0-3%) and onset of symptoms 
within the first year. Other missense mutations 



(arginine to cysteine and arginine to glutamine) are 
associated with higher enzyme activity and later onset of 
symptoms . 

7 additional mutations at the MTHFR locus are 
described and the association between genotype, enzyme 
activity, and clinical phenotype in severe MTHFR defi- 
ciency is examined. 

Patient description 

The clinical and laboratory findings of the 
patients have been reported in the published literature. 
Residual MTHFR activity was previously measured in 
cultured fibroblasts at confluence. 

Patient 354, an African-American girl, was 
diagnosed at age 13 years with mild mental retardation. 
Her sister, patient 355 was diagnosed at age 15 years 
with anorexia, tremor, hallucinations and progressive 
withdrawal. In patient 354, residual MTHFR activity was 
19% and in her sister, 355, it was 14% of control values. 
The residual activity after heating had equivalent 
thermal stability to control enzyme. 

Patient 1807, a Japanese girl whose parents are 
first cousins, had delayed walking and speech until age 2 
years, seizures at age 6 years and a gait disturbance 
with peripheral neuropathy at age 16 years. Residual 
activity of MTHFR was 3% and the enzyme was thermolabile . 

Patient 735, an African- Indian girl, was diag- 
nosed at age 7 months with microcephaly, progressive 
deterioration of mental development, apnea and coma. 
Residual activity of MTHFR was 2% of control levels. 
Thermal properties were not determined. 

Patient 1084, a Caucasian male, was diagnosed at 
age 3 months with an infantile fibrosarcoma. He was 



found to be hypotonic and became apneic . He died at the 
age of 4 months. Residual activity of MTHFR was not 
detectable. Thermal properties were not determined. 

Patient 356, the first patient reported with 
MTHFR deficiency, is an Italian-American male who pre- 
sented at age 16 years with muscle weakness, abnormal 
gait and flinging movements of the upper extremities. 
MTHFR residual activity was 20% of control values; 
activity was rapidly and exponentially inactivated at 
55° . 

Patient 458, a Caucasian male, was diagnosed at 
age 12 years with ataxia and marginal school performance 
Residual MTHFR activity was approximately 10%, and the 
activity was thermolabile . 

Patient 13 96, a Caucasian female, was described 
as clumsy and as having a global learning disorder in 
childhood. At age 14 years, she developed ataxia, foot 
drop, and inability to walk. She developed deep vein 
thrombosis and bilateral pulmonary emboli. Residual 
activity of MTHFR was 14% and the enzyme was ther- 
molabile . 

Genomic structure and intronic primers 

Exon nomenclature is based on available cDNA 
sequence in Goyette et al . (Nature Genetics, 1994, 7:195 
2 00) . Exon 1 has been arbitrarily designated as the 
region of cDNA from bp 1 to the first intron. 
Identification of introns was performed by amplification 
of genomic DNA using cDNA primer sequences. PGR product 
that were greater in size than expected cDNA sizes were 
sequenced directly. 

Mutation detection 
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Specific exons (see Table 1 for primer sequences) 
were amplified by PGR from genomic DNA and analyzed by 
the SSCP protocol . SSCP was performed with the Phastgel*^ 
system (Pharmacia), a non-isotopic rapid SSCP protocol, 
as previously described (Goyette P et al . , Nature 
Genetics, 1994, 7:195-200), or with ^Sg-labeled PGR 
products run on 6% acrylamide: 10% glycerol gels at room 
temperature (6 watts, overnight) . In some cases, the use 
of restriction endonucleases , to cleave the PGR product 
before SSGP analysis, enhanced the detection of band 
shifts. PGR fragments with altered mobility were 
sequenced directly (GIBGO, Gycle Sequencing™ kit) . If 
the sequence change affected a restriction endonuclease 
site, then the PGR product was digested with the 
appropriate enzyme and analyzed by PAGE. Otherwise, 
allele-specif ic oligonucleotide (ASO) hybridization was 
performed on a dot blot of the PGR-amplif ied exon. 

Table 1 

PGR Primers for DNA amplification and mutation analysis 

of MTHFR 



Exon 


Primer Type 


Primer Sequence (5*->^3') 


Location 


Fragment 
Size (bp) 


1 


Sense 
Antisense 


AGCCTCAACCCCTGCTTGGAGG (SEQ ID N0:5) 
TGACAGTTTGCTCCCCAGGCAC (SEQ ID N0:6) 


C 
1 


271 


4 


Sense 
Antisense 


TGAAGGAGAAGGTGTCTGCGGGA (SEQ ID N0:7) 
AGGACGGTGCGGTGAGAGTGG (SEQ ID N0:8) 


0 
1 


198 


5 


Sense 
Antisense 


CACTGTGGTTGGCATGGATGATG (SEQ ID NO:9) 
GGCTGCTCTTGGACCCTCCTC (SEQ ID NO:10) 


1 
1 


392 


6 


Sense 
Antisense 


TGCTTCCGGCTCCCTCTAGCC (SEQ ID NO:11) 
CCTCCCGCTGGCAAGAACAAAG (SEQ ID N0:12) 


1 

1 


251 
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Table 2 

Sxjmmairy of genotypes, enzyme activity, age at onset, and 





background 


of patients 


with 


MTHFR deficiency 


Patient ® 


BPChanges^ 


Amino acid 
changes 


% 

Activity 


Age at Onset 


Background 


1807 


C764T/C764T 


Pro-^Leu/Pro->Leu 


3 


within 1st year 


Japanese 


735 


C692T/C692T 


Thr->Met/Thr->Met 


2 


7 months 


African Indian 


1084 


C692T/C692T 


Thr-»Met/Thr->Met 


0 


3 months 


Caucasian 


1554 


C559T/C559T 


Arg->Ter/Arg->Ter 


0 


1 month 


Native American 
(HopI) 


1627 


C559T/C559T 


Arg-»Ter/Arg->Ter 


1 


1 month 


Native American 
(Choctaw) 


^oo 




Arg— > Cys/Arg— >Cys 


20 


16yrs 


Italian American 


458 


C1015T/G167A 


Arg— >oys/Arg— 1 n 




11 yrs 


Caucasian 


1396 


C1081T/G167A 


Arg-^Cys/Arg->Gln 


14 


14 yrs 


Caucasian 


1779^ 


G482A/? 


Arg-^GIn/? 


6 


15 yrs 


French Canadian 


1834^ 


G482A/? 


Arg->Gln/? 


7 


Asymptomatic 
at 37 yrs 


French Canadian 


1863 


G482A/? 


Arg->Gln/? 


14 


21 yrs 


Caucasian 


354«^ 


792 + 1G->A/? 


5' splice site/? 


19 


13 yrs 


African American 


355^ 


792 + 1G->A/? 


5' splice site/? 


14 


11 yrs 


African American 



5 ^ Patients 1554, 1627, 1779, 1834 and 1863 were previously reported 
by Goyette et al . (1994) . 
^ ? = unidentified mutation. 
^ Patients 1779 and 1834 are sibs. 
^ Patients 354 and 355 are sibs. 

10 

(1) 5' splice site mutation 

Amplification of cDNA, bp 653-939, from reverse- 
transcribed total fibroblast RNA revealed 2 bands in 
sisters 354 and 355: a smaller PGR fragment (230 bp) in 
15 addition to the normal 287 bp allele (Fig. 8A) . Fig. 8A 
is the PAGE analysis of amplification products of cDNA bp 
653-939, from reverse transcribed RNA. Controls have the 
expected 287-bp fragment while patients 354 and 355 have 



an additional 230-bp fragment. Sequencing of the smaller 
fragment identified a 57-bp in-frame deletion which would 
remove 19 amino acids (Fig. 8B) . Fig. 8B is the direct 
sequencing of the PGR products from patient 354. The 57- 
5 bp deletion spans bp 73 6-7 92 of the cDNA. An almost per- 
fect 5' splice site (boxed) is seen at the 5' deletion 
breakpoint. Analysis of the sequence at the 5' deletion 
breakpoint in the undeleted fragment revealed an almost 
perfect 5' splice site consensus sequence (AG/gcatgc) . 
10 This observation suggested the presence of a splicing 
mutation in the natural 5' splice site that might 
activate this cryptic site, to generate the deleted 
allele. The sequence following the deletion breakpoint, 
in the mutant allele, corresponded exactly to the 
15 sequence of the next exon. Amplification of genomic DNA, 
using the same amplification primers as those used for 
reverse-transcribed RNA, generated a 1.2-kb PGR product 
indicating the presence of an intron. Direct sequencing 
of this PGR fragment in patient 354 identified a 
2 0 heterozygous G->A substitution in the conserved GT 

dinucleotide of the intron at the 5' splice site (Fig. 
8C) . Fig. 8G is the sequencing of the 5' splice site in 
control and patient 354. The patient carries a 
heterozygous G->A substitution in the 5' splice site 
25 (boxed) . Intronic sequences are in lower case. This 
substitution abolished a Hphl restriction endonuclease 
site which was used to confirm the mutation in the 2 
sisters (Fig. 8D) . Fig. 8D is the HphI restriction 
endonuclease analysis on PGR products of DNA for exon 4 
30 of patients 354 and 355, and of 3 controls (G) . The 198- 
bp PGR product has 2 HphI sites. The products of 
digestion for the control allele are 151, 24 and 23 bp. 
The products of digestion for the mutant allele are 175 
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and 23 bp due to the loss of a Hphl site. The fragments 
of 24 and 23 bp have been run off the gel. 

(2) Patients with homozygous coding substitutions 

SSCP analysis of exon 4 for patient 1807 revealed 
an abnormally-migrating fragment, which was directly 
sequenced to reveal a homozygous C->T substitution (bp 
764) converting a proline to a leucine residue. This 
change creates a Mnll restriction endonuclease site, 
which was used to confirm the homozygous state of the 
mutation (Fig. 9A) , Fig. 9A is the Mnll restriction 
analysis of exon 4 PGR products for patient 1807 and 3 
controls (C) . Expected fragments: control allele, 90, 
46, 44, 18 bp; mutant allele, 73 , 46 , 44 , 18 , 17 bp . An 
additional band at the bottom of the gel is the primer. 
Fifty independent control Caucasian chromosomes and 12 
control Japanese chromosomes were tested by restriction 
analysis; all were negative for this mutation. 
Homozygosity in this patient is probably due to the 
consanguinity of the parents. 

Patients 735 and 1084 had the same mutation in 
exon 4, in a homozygous state: a C^T substitution (bp 
692) which converted an evolutionarily conserved 
threonine residue to a methionine residue, and abolished 
a Nlalll restriction endonuclease site. Allele-specif ic 
oligonucleotide hybridization to amplified exon 4 (Figs. 
lOA and lOB) was used to confirm the mutation in these 2 
patients and to screen 60 independent chromosomes, all of 
which turned out to be negative. Fig. lOA is the 
hybridization of mutant oligonucleotide (6 92T) to exon 4 
PGR products from patients 735, 1084 and 30 controls. 
Only DNA from patients 735 and 1084 hybridized to this 
probe. Fig. lOB is the hybridization of normal 



oligonucleotide (692C) to stripped dot blot from A. All 
control DNAs hybridized to this probe. 

Patient 356 showed a shift on SSCP analysis of 
exon 5. Direct sequencing revealed a homozygous C— >T 
5 substitution (bp 985) which converted an evolutionarily 
conserved arginine residue to cysteine; the substitution 
abolished an Acil restriction endonuclease site. This 
was used to confirm the homozygous state of the mutation 
in patient 356 (Fig. 9B) and its presence in the 

10 heterozygous state in both parents. Fifty independent 
control chromosomes, tested in the same manner, were 
negative for this mutation. Fig. 9B is the Acil 
restriction analysis of exon 5 PGR products for patient 
356, his father (F) , his mother (M) , and 3 controls (C) . 

15 Expected fragments: control allele, 12 9, 105, 90, 68 bp; 
mutant allele, 195, 129, 68 bp. 

(3) Patients who are genetic compounds 

Patient 458 is a compound heterozygote of a 

20 mutation in exon 5 and a mutation in exon 1. The exon 5 
substitution (C— ^^T at bp 1015) resulted in the sub- 
stitution of a cysteine residue for an arginine residue; 
this abolished a Hhal restriction endonuclease site, 
which was used to confirm the mutation in patient 458 

25 (Fig. 9C) and to show that 50 control chromosomes were 
negative. Fig. 9C is the Hhal restriction analysis of 
exon 5 PGR products for patient 458 and 4 controls (G) . 
Expected fragments: control allele, 317 and 75 bp; mutant 
allele 392 bp. The 75-bp fragment is not shown in Fig. 

30 9G. The second mutation was a heterozygous G->A 
substitution (bp 167) converting an arginine to a 
glutamine residue. Allele-specif ic oligonucleotide 
hybridization to amplified exon 1 confirmed the 
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heterozygous state of this mutation in patient 458 and 
identified a second patient (13 96) carrying this mutation 
also in the heterozygous state (Figs. IOC and lOD) . Fig. 
IOC is the hybridization of mutant oligonucleotide (167A) 
5 to exon 1 PGR products from patients 458, 13 96 and 31 
controls. Fig. lOD is the hybridization of normal 
oligonucleotide {167G) to stripped dot blot from C. None 
of the 62 control chromosomes carried this mutation. 

The second mutation in patient 13 96 was identi- 
10 fied in exon 6: a heterozygous C^T substitution (bp 

1081) that converted an arginine residue to a cysteine 
residue, and abolished a Hhal restriction endonuclease 
y site. Restriction analysis confirmed the heterozygous 

substitution in 1396 (Fig. 9D) and showed that 50 control 
"ll 15 chromosomes were negative. Tooth decay does not occur in 

in patients having a saliva above pH 5.0. Fig. 9D is the 

" Hhal restriction analysis of exon 6 PGR products for 

l), patient 1396 and 2 controls (G) , Expected fragments: 

control allele, 152, 86, 13 bp; mutant allele 165, 86 bp. 

m 

20 The 13 -bp fragment has been run off the gel. 

'i J. 

(4) Additional sequence changes 

Hhal analysis of exon 6, mentioned above, 
revealed a DNA polymorphism. This change is a T->C 

25 substitution at bp 1068 which does not alter the amino 

acid (serine) , but creates a Hhal recognition site. T at 
bp 1068 was found in 9% of tested chromosomes. Sequence 
analysis identified 2 discrepancies with the published 
cDNA sequence: a G^A substitution at bp 542 which 

3 0 converts the glycine to an aspartate codon, and a C-^T 
change at bp 1032 which does not alter the amino acid 
(threonine) . Since all DNAs tested (>50 chromosomes) 
carried the A at bp 542 and the T at bp 1032, it is 
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likely that the sequence of the original cDNA contained 
some cloning artifacts. 

Genotype rpheno type correlation 

5 Table 2 summarizes the current status of muta- 

tions in severe MTHFR deficiency. In 8 patients, both 
mutations have been identified; in 5 patients (3 fami- 
lies) / only 1 mutation has been identified. Overall the 
correlation between the genotype, enzyme activity, and 

10 phenotype is quite consistent. Five patients, with onset 
of symptoms within the first year of life, had < 3% of 
control activity. Three of these patients had missense 
mutations in the homozygous state: two patients with the 
threonine to methionine substitution (C692T) and one 

15 patient with the proline to leucine substitution (C764T) . 
The nonsense mutation (C559T) in the homozygous state in 
patients 1554 and 162 7 (previously reported in Goyette P 
et al., Nature Genetics, 1994, 7:195-200) is also 
associated with a neonatal severe form, as expected. 

2 0 The other patients in Table 2 had > 6% of control 

activity and onset of symptoms within or after the 2nd 
decade of life; the only exception is patient 1834, as 
previously reported (Goyette P et al . , Nature Genetics, 
1994, 7:195-200). The three patients (356, 458 and 1396) 

25 with missense mutations (G167A, C985T, C1015T and C1081T) 
are similar to those previously reported (patients 1779, 
1834 and 1863) who had an arginine to glutamine 
substitution and a second unidentified mutation (Goyette 
P et al., Nature Genetics, 1994, 7:195-200). The sisters 

30 with the 5' splice mutation and an unidentified second 
mutation also had levels of activity in the same range 
and onset of symptoms in the second decade, but the 
activity is likely due to the second unidentified allele. 
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Discussion 

The patients come from diverse ethnic back- 
grounds. Although patients 1554 and 1627 are both Native 
Americans, the mutations occur on different haplotypes, 
suggesting recurrent mutation rather than identity by 
descent. Since the substitution occurs in a CpG 
dinucleotide, a "hot spot" for mutation, recurrent 
mutation is a reasonable hypothesis. It is difficult to 
assess whether some mutations are population-specific 
since the numbers are too small at the present time. 

MTHFR deficiency is the most common inborn error 
of folate metabolism, and a major cause of hereditary 
homocysteinemia. The recent isolation of a cDNA for 
MTHFR has permitted mutational analysis at this locus, 
with the aims of defining important domains for the 
enzyme and of correlating genotype with phenotype in 
MTHFR -deficient patients. 

The 7 mutations described here (6 single amino 
acid substitutions and a 5* splice site mutation) bring 
the total to 9 mutations identified thus far in severe 
MTHFR deficiency and complete the mutation analysis for 8 
patients. The identification of each mutation in only 
one or two families points to the striking degree of 
genetic heterogeneity at this locus. Seven of the 9 
mutations are located in CpG dinucleotides , which are 
prone to mutational events. 

5' splice site mutation 

The G^A substitution at the GT dinucleotide of 
the 5' splice site in patients 354 and 355 results in a 
57bp in- frame deletion of the coding sequence, which 
should delete 19 amino acids of the protein. This 
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deletion occurs as a result of the activation of a 
cryptic 5» splice site (AG/gc) even though this cryptic 
site does not have a perfect 5' splice site consensus 
sequence (AG/gt) . However, GC (instead of GT) as the 
first 2 nucleotides of an intron has been reported in 
several naturally-occurring splice sites, such as in the 
genes for human prothrombin and human adenine 
phosphoribosyltransf erase and twice within the gene for 
the largest subunit of mouse RNA polymerase II. The 
remaining nucleotides of the cryptic site conform to a 
normal splice site consensus sequence with its expected 
variations (Ago G79/giootioo^59^7l982t^50) • 
unlikely that the deleted enzyme resulting from this 
aberrantly- spliced mRNA would have any activity; 8 of the 
19 deleted amino acids are conserved in the E. Coli 
enzyme. Although the 2 patients show residual enzyme 
activity in the range of 20% of controls, the activity is 
probably due to the unidentified second allele in these 
patients . 

6 missense mutations 

The Arg->Cys substitution (C1081T) in patient 1396 
is within a hydrophilic sequence previously postulated to 
be the linker region between the catalytic and regulatory 
domains of MTHFR (Goyette P et al . , Nature Genetics, 
1994, 7:195-200). These 2 domains are readily separable 
by mild trypsinization of the porcine enzyme. The linker 
domain, a highly- charged region, is likely to be located 
on the outside surface of the protein and therefore more 
accessible to proteolysis. Because the Arg->Cys 
substitution converts a charged hydrophilic residue to an 
uncharged polar residue, it cannot be considered a 
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conservative change, and could affect the stability of 

the enzyme . 

The 2 Arg^Cys substitutions identified in 
patients 356 and 458 (C985T and C1015T, respectively) may 
5 be involved in binding the FAD cof actor. Previous work 
in the literature showed that heating fibroblast extracts 
at 55^, in the absence of the FAD cof actor, inactivated 
MTHFR completely. The addition of FAD to the reaction 
mixture before heat inactivation restored some enzyme 

10 activity to control extracts and to extracts from some 

patients, while the extracts of patients 356 and 458 were 
unaffected. Based on these observations, it was 
suggested that these 2 patients had mutations affecting a 
region of the protein involved in binding FAD. The 2 

15 mutations are found in close proximity to each other, 

within 11 amino acids. In patient 3 56, the Arg residue 
is evolutionarily-conserved in E. Coli and is found in a 
stretch of 9 conserved amino acids, suggesting a critical 
role for this residue; the altered Arg residue in patient 

20 458 is not evolutionarily-conserved. Crystal structure 

analysis of medium chain acyl-CoA dehydrogenase (MCAD) , a 
f lavoprotein, has defined critical residues involved in 
the binding of FAD. Two consecutive residues of the MCAD 
protein, Metl65 and Trpl66, involved in interactions with 

25 FAD, can also be identified in MTHFR, 3 and 4 amino acids 
downstream, respectively, from the Arg residue altered in 
patient 458. 

The Thr^Met substitution (C692T) is found in a 
region of high conservation with the E. Coli enzyme and 

3 0 in a region of good homology with human dihydrof olate 

reductase (DHFR) (Fig. 11) . In Fig. 11, = is identity; • 
is homology; and $ is identity to bovine DHFR enzyme. An 
asterisk (*) indicates location of Thr->Met substitution. 



Considering the early-onset phenotype of the patients, 
one can assume that the threonine residue is critical for 
activity or that it contributes to an important domain of 
the protein. This region of homology in DHFR contains a 
5 residue, Thrl3 6, which has been reported to be involved 
in folate binding. This Thr residue in DHFR aligns with 
a Ser residue in MTHFR, an amino acid with similar 
biochemical properties. The Thr -> Met substitution is 
located 8 amino acids downstream from this Ser codon, in 
10 the center of the region of homology between the 2 
enzymes. It is therefore hypothesized that the Thr 
Met substitution may alter the binding of the folate sub- 
strate . 

The G167A (Arg -> Gin) and C764T (Pro -» Leu) 
15 substitutions both affect non-conserved amino acids. 

Their importance in the development of MTHFR deficiency 
cannot be determined at the present time. All the 
mutations identified thus far are located in the 5' end 
of the coding sequence, the region thought to encode the 
20 catalytic domain of MTHFR. Mutation analysis has been 
useful in beginning to address the structure: function 
properties of the enzyme as well as to understand the 
diverse phenotypes in severe MTHFR deficiency. 

25 III. Identification of A^V mutation 

SSCP analysis and direct sequencing of PGR 
fragments were used to identify a C to T substitution at 
bp 677, which converts an alanine residue to a valine 
residue (Fig. 5A) . The primers for analysis of the A-^V 
3 0 change are: 5 ' -TGAAGGAGAA GGTGTCTGCG GGA-3 * (SEQ ID 

NO: 13) (exonic) and 5 » -AGGACGGTGC GGTGAGAGTG-3 ' (SEQ ID 
NO: 14) (intronic) ; these primers generate a fragment of 
198 bp. Fig. 5A depicts the sequence of two individuals. 
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a homozygote for the alanine residue and a hotnozygote for 
the valine residue. The antisense strands are depicted. 
This alteration creates a Hinfl site (Fig. 5B) , which was 
used to screen 114 unselected French Canadian 
5 chromosomes; the allele frequency of the substitution was 
.38. The substitution creates a Hinfl recognition 
sequence which digests the 198 bp fragment into a 175 bp 
and a 23 bp fragment; the latter fragment has been run 
off the gel. Fig. 5B depicts the three possible 

10 genotypes. The frequency of the 3 genotypes were as 
follows: -/-, 37%; +/-, 51%; and +/+, 12% (the (+) 
indicates the presence of the Hinfl restriction site and 
a valine residue) . 

The alanine residue is conserved in porcine 

15 MTHFR, as well as in the corresponding metF and stymetF 
genes of £?. Coli and S. Typhimurium, respectively. The 
strong degree of conservation of this residue, and its 
location in a region of high homology with the bacterial 
enzymes, alluded to its importance in enzyme structure or 

20 function. Furthermore, the frequency of the (+/+) 
genotype was consistent with the frequency of the 
thermolabile MTHFR variant implicated in vascular 
disease. 

25 Clinical material 

To determine the frequency of the A— >V mutation, 
DNA from 57 individuals from Quebec was analyzed by PGR 
and restriction digestion. The individuals, who were all 
French Canadian, were not examined clinically or 
30 biochemically. 

The 4 0 individuals analyzed in Table 3 had been 
previously described in Engbersen et al . (Am. J. Hum. 
Genet., 1995, 56:142-150). Of the 13 cardiovascular 
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patients, 8 had cerebrovascular arteriosclerosis and 5 

had peripheral arteriosclerosis. Five had thermolabile 

MTHFR while 8 had thermostable MTHFR (greater than 33% 

residual activity after heating) . Controls and patients 

5 were all Dutch- Caucasian, between 20-60 years of age. 

None of these individuals used vitamins which could alte: 

homocysteine levels. Enzyme assays and homocysteine 

determinations were also reported by Engbersen et al . 

{Am. J. Him. Genet., 1995, 56:142-150). 
0 Table 3 

Correlation between MTHFR genotype and enzyme activity, 
thermolability and plasma homocysteine level 





-/- 


+/- 


+/+ 




n=19 


n=9 


n=12 


specific activity^'** 
(nmol CH20/mg.protein/hr) 


22.9±1.7 
(11.8-33.8) 


15.0 ±0.8 
(10.2-18.8) 


6.9 ± 0.6 
(2.6-10.2) 


residual activity after 
heatinga-l' (o/,) 


66.8 ± 1.5 
(55-76) 


56.2 ± 2.8 
(41-67) 


21.8 ±2.8 
(10-35) 


plasma homocysteine 
(pM)(after fasting) 


12.6 ± 1.1 
(7-21) 


13.8 ± 1.0 
(9.6-20) 


22.4 ± 2.9 
(9.6-42) 


plasma homocysteine 
(^M)(post-methlonine load) 


41.3±5.0d 
(20.9 -110) 


41 ± 2.8 
(29.1-54) 


72.6 ± 11.76 
(24.4-159) 



15 ^ one-way anova p<.01 

^ paired t test for all combinations p<.01 

c paired t test p<.05 for +/+ group versus +/- group or -/- group; 

p>.05 for +/- versus -/- group. 
^ n=18 for this parameter 
20 ® n=ll for this parameter 

Enzyme activity and plasma homocysteine were 
determined as previously reported. Each value represents 
mean ± standard error. The range is given in parentheses 
below the mean. 

25 

Correlation of A^V mutation with altered MTHFR function 
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A genotypic analysis was performed and enzyme 
activity and thermolability were measured in a total of 
40 lymphocyte pellets from patients with premature 
vascular disease and controls. 13 vascular patients were 
selected from a previous study (Engbersen et al . , Am. J. 
Hum. Genet., 1995, _56: 142-150) , among which 5 were 
considered to have thermolabile MTHFR. From a large 
reference group of 89 controls, all 7 individuals who had 
thermolabile MTHFR were studied, and an additional 20 
controls with normal MTHFR were selected from the same 
reference group. Table 3 documents the relationship 
between genotypes and specific enzyme activity, 
thermolability and plasma homocysteine level. The mean 
MTHFR activity for individuals homozygous for the 
substitution (+/+) was approximately 30% of the mean 
activity for (-/-) individuals, homozygous for the 
alanine residue. Heterozygotes had a mean MTHFR activity 
that was 65% of the activity of (-/-) individuals; this 
value is intermediate between the values for (-/-) and 
(+/+) individuals. The ranges of activities showed some 
overlap for the heterozygous and (-/-) genotypes, but 
homozygous (+/+) individuals showed virtually no overlap 
with the former groups. A one-way analysis of variance 
yielded a p value <.0001; a pairwise Bonferroni t test 
showed that all three genotypes were significantly 
different with p<0.01 for the three possible 

combinations. 

The three genotypes were all significantly dif- 
ferent (p<.01) with respect to enzyme thermolability. 
The mean residual activity after heat inactivation for 5 
minutes at 46° was 67%, 56% and 22% for the {-/-), (+/-) 
and (+/+) genotypes, respectively. While the degree of 
thermolability overlaps somewhat for (-/-) individuals 
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and heterozygotes, individuals with two mutant alleles 
had a distinctly lower range. Every individual with the 
(+/+) genotype had residual activity <35% after heating, 
and specific activity <50% of that of the (-/-) genotype. 

Total homocysteine concentrations, after fasting 
and 6 hours after methionine loading, were measured in 
plasma by high performance liquid chromatography using 
fluorescence detection. Fasting homocysteine levels in 
(+/+) individuals were almost twice the value for (+/-) 
and (-/-) individuals. The differences among genotypes 
for plasma homocysteine were maintained when homocysteine 
was measured following 6 hours of methionine loading. A 
one-way anova yielded a p < .01 for the fasting and post- 
methionine homocysteine levels. A pairwise Bonferroni t 
test showed that only homozygous mutant individuals had 
significantly elevated homocysteine levels (p<.05)- 

PCR-based mutagenesis for expression of A— >V mutation in 
vitro 

PCR-based mutagenesis, using the cDNA- containing 
Bluescript™ vector as template, was used to create the A 
to V mutation. Vent™ polymerase (NEB) was used to reduce 
PGR errors- The following primers were used: primer 1, 
bp -200 to -178, sense; primer 2, bp 667 to 687, 
ant i sense, containing a mismatch. A, at bp 677; primer 3, 
667 to 687, sense, containing a mismatch, T, at bp 677; 
primer 4, bp 1092 to 1114, antisense. PGR was performed 
using primers 1 and 2 to generate a product of 887 bp, 
and using primers 3 and 4 to generate a product of 447 
bp. The two PGR fragments were isolated from a 1.2% 
agarose gel by Geneclean™ (BIO 101) . A final PGR 
reaction, using primers 1 and 4 and the first 2 PGR 
fragments as template, was performed to generate a 1 . 3 kb 
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band containing the mutation. The 1.3 kb fragment was 
digested with Ncol and Mscl , and inserted into the wild- 
type cDNA- containing expression vector by replacing the 
sequences between the Ncol site at bp 11 and the Mscl 
site at bp 943. The entire replacement fragment and the 
cloning sites were sequenced to verify that no additional 
changes were introduced by PGR. 

Expression analysis of wild- type and mutagenized cDNA 

Overnight cultures of JM105™ containing vector 
alone, vector + wild- type MTHFR cDNA, or vector + 
mutagenized cDNA were grown at 37° C. in 2 x YT media 
with .05 mg/ml ampicillin. Fresh 10 ml. cultures of each 
were inoculated with approximately 5 0 /zL of overnight 
cultures for a starting O.D. of 0.05, and were grown at 
37° C to an O.D. of 1 at 420 nM. Cultures were then 
induced for 2 hrs . with 1 mM IPTG and pelleted. The 
cells were resuspended in TE buffer with 2 /xg/ml 
aprotinin and 2/xg/ml leupeptin (3.5 x wet weight of 
cells) . Cell suspensions were sonicated on ice for 3 x 
15 sec. to break open cell membranes and then centrifuged 
for 30 min. at 4°C. to pellet cell debris and unlysed 
cells. The supernatant was removed and assayed for 
protein concentration with the Bio-Rad™ protein assay. 
Western analysis was performed using the Amersham ECL™ 
kit according to the instructions of the supplier, using 
antiserum generated against purified porcine liver MTHFR. 
Enzymatic assays were performed by established 
procedures; pre -treating the extracts at 46°C. for 5 min. 
before determining activity assessed thermolability . 
Specific activities (nmol formaldehyde/hr . /mg. protein) 
were calculated for the 2 cDNA- containing constructs 
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after subtraction of the values obtained with vector 
alone (to subtract background E. Coli MTHFR activity) . 

The MTHFR cDNA (2.2 kb) (Fig. 6) has an open 
reading frame of 1980 bp, predicting a protein of 74.6 
5 kDa. The purified porcine liver enzyme has been shown to 
have subunits of 77 kDa. Western analysis (Fig. 7A) of 
several human tissues and of porcine liver has revealed a 
polypeptide of 77 kDa in all the studied tissues, as well 
as an additional polypeptide of approximately 70 kDa in 

10 human fetal liver and in porcine liver, suggesting the 
presence of isozymes. Two [ig of bacterial extract 
protein was used for lanes 1-3. The tissues (lanes 4-8) 
were prepared by homogenization in .25M sucrose with 
protease inhibitors (2 /xg/ml each of aprotinin and 

15 leupeptin) , followed by sonication (3 x 15 sec.) on ice. 
The extracts were spun for 15 min. in a microcentrifuge 
at 14,000 g and 100 fig of supernatant protein was used 
for Western analysis. h=human; p=porcine. 

The wild- type cDNA and a mutagenized cDNA, con- 

2 0 taining the A— >V substitution, were expressed in E. Coli 

to yield a protein of approximately 70 kDa (Fig. 7A) , 
which co-migrates with the smaller polypeptide mentioned 
above. Treatment of extracts at 46°C for 5 minutes 
revealed that the enzyme containing the substitution was 
25 significantly more thermolabile than the wild-type enzyme 
(p<.001; Fig. 7B) . Two separate experiments (with 3-4 
replicates for each construct for each experiment) were 
performed to measure thermostable activity of the wild- 
type MTHFR and mutagenized MTHFR A^V cDNAs. The values 

3 0 shown represent mean ± standard error for each 

experiment, as % of residual activity after heating. The 
means of the specific activities before heating 
(expressed as nmol f ormaldehyde/hr . /mg. protein) were as 
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follows: Exp. 1, 3.8 and 5.3 for MTHFR and MTHFR A->V, 
respectively; Exp. 2, 6.2 and 7.5 for MTHFR and MTHFR A-> 
V, respectively. The expression experiments were not 
designed to measure differences in specific activity 
5 before heating, since variation in efficiencies of 
expression could contribute to difficulties in 
interpretation- Curiously though, the specific activity 
for the mutant construct was higher in both experiments. 
It is possible that the mutant protein has increased 

10 stability in E. Coli, or that inclusion bodies in the 
extracts contributed to differences in recovery of 
properly-assembled enzyme. 

These studies have identified a common substi- 
tution in the MTHFR gene which results in thermolability 

15 in vitro and in vivo. The mutation, in the heterozygous 
or homozygous state, correlates with reduced enzyme 
activity and increased thermolability of MTHFR in 
lymphocyte extracts. A significant elevation in plasma 
homocysteine was observed in individuals who were 

20 homozygous for the mutation. Statistically-significant 
differences for homocysteine levels were not observed 
between heterozygotes and (-/-) individuals; this 
observation is not surprising, since plasma homocysteine 
can be influenced by several environmental factors, 

25 including intake of folate, vitamin 6^2/ vitamin Bg, and 
methionine, as well as by genetic variation at other 
loci, such as the cystathionine-6-synthase gene. 

The alanine to valine substitution conserves the 
hydrophobicity of the residue and is associated with 

30 small changes in activity, in contrast to non- 
conservative changes, such as the previously-reported 
arginine to glutamine change in MTHFR, which is asso- 
ciated with a greater decrease in enzyme activity and 
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severe hyperhomocysteinemia - The alanine residue is 
situated in a region of homology with the bacterial metF 
genes. The same region of homology was also observed in 
the human dihydrof olate reductase (DHFR) gene (Fig. 11), 
5 although the alanine residue itself is not conserved; 

this region of amino acids 13 0-149 of DHFR contains T136 
which has been implicated in folate binding in an 
analysis of the crystal structure of recombinant human 
DHFR. It is tempting to speculate that this region in 
10 MTHFR is also involved in folate binding and that the 

enzyme may be stabilized in the presence of folate. This 
hypothesis is compatible with the well -documented 
influence of folate on homocysteine levels and with the 
reported correction of mild hyperhomocysteinemia by folic 
15 acid in individuals with premature vascular disease, and 
in individuals with thermolabile MTHFR. 

Although the cDNA is not long enough to encode 
the larger MTHFR polypeptide, it is capable of directing 
synthesis of the smaller isozyme. The ATG start codon 
20 for this polypeptide is within a good consensus sequence 
for translation initiation. Whether the isozyme is 
restricted to liver and what its role is in this tissue 
remain to be determined. 

These data have identified a common genetic 
25 change in MTHFR which results in thermolability ; these 
experiments do not directly address the relationship 
between this change and vascular disease. Nonetheless, 
this polymorphism represents a diagnostic test for 
evaluation of MTHFR thermolability in hyperhomo- 
30 cysteinemia. Large case-control studies are required to 
evaluate the frequency of this genetic change in various 
forms of occlusive arterial disease and to examine the 
interaction between this genetic marker and dietary 



factors. Well-defined populations need to be examined, 
since the limited data set thus far suggests that 
population-specific allele frequencies may exist. More 
importantly, however, the identification of a candidate 
5 genetic risk factor for vascular disease, which may be 

influenced by nutrient intake, represents a critical step 
in the design of appropriate therapies for the 
homocysteinemic form of arteriosclerosis. 

10 cDNA FOR MTHFR AND ITS POTENTIAL UTILITY 

The cDNA sequence is a necessary starting point 
for the detection of MTHFR sequence abnormalities that 
would identify individuals at risk for cardiovascular and 
neurological diseases, as well as other disorders 
15 affected by folic acid metabolism. Diagnostic tests by 

DNA analysis are more efficient and accurate than testing 
by enzymatic/biochemical assays. Less blood is required 
and results are available in a shorter period of time. 
The tests could be performed as a routine operation in 
20 any laboratory that performs molecular genetic diagnosis, 
without the specialized reagents/expertise that is 
required for an enzyme -based test. 

The second major utility of the cDNA would be in 
the design of therapeutic protocols, for correction of 
25 MTHFR deficiency. These protocols could directly involve 
the gene, as in gene therapy trials or in the use of 
reagents that could modify gene expression. 
Alternatively, the therapy might require knowledge of the 
amino acid sequence (derived from the cDNA sequence) , as 
3 0 in the use of reagents that would modify enzyme activity. 
The identification of sequences and/or sequence changes 
in specific regions of the cDNA or protein, such as FAD 
binding sites or f olate-binding sites, are useful in 



designing therapeutic protocols involving the above 
nutrients . 

UTILITY OF INVENTION IN CLINICAL AND DIAGNOSTIC STUDIES 

Coronary artery disease patients in Montreal 
(n=153) were studied to examine the frequency of the 
alanine to valine substitution. Fourteen percent of the 
patients were homozygous for this mutation. An analysis 
of 70 control individuals (free of cardiovascular 
disease) demonstrated that only seven % of these 
individuals were homozygous for the alanine to valine 
mutation. 

Analysis of homocysteine levels in 123 men of the 
above patient group indicated that the mutant allele 
significantly raised homocysteine levels from 10.2 
micromoles/L in homozygous normal men to 11.5 and 12.7 in 
heterozygotes and homozygous mutants, respectively. 

Families with a child with spina bifida, a neural 
tube defect, have been examined for the presence of the 
alanine to valine mutation. Approximately 16% of mothers 
who had a child with spina bifida were homozygous for 
this mutation, while only 5% of control individuals were 
homozygous. Fathers of children with spina bifida also 
had an increased prevalence of the homozygous mutant 
genotype (10%) as did the affected children themselves 
(13%) . 

Table 4 indicates the interactive effect of folic 
acid with the homozygous mutant alanine to valine change. 
In a study of families from Framingham, Massachusetts and 
Utah, individuals who were homozygous mutant but had 
folate levels above 5 ng/ml did not have increased 
homocysteine levels compared to individuals with the 
normal or heterozygous genotype. However, individuals 
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who were homozygous mutant but had folate levels below 5 
ng/ml had homocysteine levels that were significantly 
higher than the other genotypes. 

Table 4 

5 Mean fasting and PML homocysteine levels for different 

MTHFR genotypes 



MTHFR genotype 



Plasma 


Normals {-/-) 


Heterozygote 


Homozygotes 


Ptrend 


Homocysteine 




s (+/-) 


(+/+) 




N 


58 


61 


30 




Fasting* 


9.4 


9.2 


12.1 


0.02 


Folate <5 ng/mL 


10.2 


10.4 


15.2 


0.002 


Folate ' 5ng/mL 


8.2. 


7.5 


7.5 


0.52 


Post-Methionine load 


30.0 


30.9 


31.3 


0.62 



* significant interaction between folate levels and genotype 
(p=0.03) 



10 

Table 4 provides preliminary data for therapeutic 
intervention by folic acid supplementation to individuals 
who are homozygous for the alanine to valine change. The 
data suggest that higher levels of plasma folate would 

15 lead to normalization of homocysteine levels in mutant 

individuals and might prevent the occurrence of disorders 
ass ociated with high hom ocysteine levels, such as 
cardiovascular disease, neural tube defects, and possibly 
other disorders. Folic acid supplementation for mutant 

20 individuals might also restore methionine and S- 

adenosylmethionine levels to normal. This would be 
relevant for disorders that are influenced by 
methylation, such as neoplasias, developmental anomalies, 
neurologic disease, etc. 
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Genetic polymorphism in methylenetetrahydrof olate 
reductase (MTHFR) associated with decreased activity 

A common mutation (C677T) results in a 

thermolabile enzyme with reduced specific activity 
5 (approximately 35% of control values in homozygous mutant 
individuals) . Homozygous mutant individuals 
(approximately 10% of North Americans) are predisposed to 
mild hyperhomocysteinemia, when their folate status is 
low. This genetic-nutrient interactive effect is 

10 believed to increase the risk for neural tube defects 

(NTD) and vascular disease. There has been reported an 
increased risk for spina bifida in children with the 
homozygous mutant genotype for C677T. With the present 
invention, a second common variant in MTHFR (A1298C) , an 

15 E to A substitution has been characterized. Homozygosity 
was observed in approximately 10% of Canadian 
individuals. This polymorphism was associated with 
decreased enzyme activity; homozygotes had approximately 
60% of control activity in lymphocytes. 

20 A sequence change (C1298A) has been identified. 

Heterozygotes for both the C677T and the A1298C mutation, 
approximately 15% of individuals, had 50%-60% of control 
activity, a value that was lower than that seen in single 
heterozygotes for the C677T variant. No individuals were 

25 homozygous for both mutations. A silent genetic variant 
T1317C, was identified in the same exon. It was 
relatively infrequent (allele frequency = 5%) in the 
study group, but was common in a small sample of African 
individuals (allele frequency = 39%) . 
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in addition, by virtue of the role of MTHFR in 
folate -dependent homocysteine metabolism, the C677T 
mutation predisposes to mild hyperhomocysteinemia, a risk 
factor for vascular disease, in the presence of low 
folate status. By the present invention, the frequency 
of the A12 98C variant has been determined and its 
potential impact on enzyme function has been assessed. 

Patients with spina bifirda and mothers of 
patientfl^ere recruited from l^e Spina Bifida Clinic at 
the Montreal Children's Hosp/tal following approval from 
the institutional Review Bd4rd. Control children and 
mothers of controls were /ecruited from the same 
institution. Blood sam/es were used to prepare DNA from 
peripheral leukocytes Tto assay MTHFR activity in 
lymphocyte extracts, ind to measure total plasma 
homocysteine (tHcy)/ The presence of the C677T mutation 
(A to V) was evaluated by PCR and Hinfl digestion (2) . 
The A1298C mutat^n was initially examined by PCR and 
Mboll digestior/ (5) . The silent mutation, T1317C, was 
identified by/sSCP and sequence analysis in a patient 
with severe /MTHFR deficiency and homocystinuria. This 
patient, a/ African-American female, already carries a 
previously-described splice mutation (patient 354 (8)). 
Since tZs mutation also creates a MboII site and results 

[gestion pattern identical to that of the A1298C 
mutaV^on, distinct artificially-created restriction sites 

used to distinguish between these 2 mutations. 
Ejection of the A1298C polymorphism was performed with 
4e use of the sense primer 5 ' -GGGAGGAGCTGACCAGTGCAG-3 ' 



and the antisense primer (^-GGGGTCAGGCCAGGGGCAG-3 ' ) , 
such that the 13 8bp PCR/^agment was digested into 119bp 
and 19bp fragments b/Fnu4HI in the presence of the C 
allele. An antis^e primer (5' -GGTTCTCCCGAGAGGTAAAGATC- 
3'), which int;i?^uces a TaqI site, was similarly designed 
to identifv/^e C allele of the T1317C polymorphism. 
Together/vith a sense primer (5' -CTGGGGATGTGGTGGCACTGC- 
3'), Jfe^ie 227bp fragment is digested into 202bp and 25bp 
fjg^ments , 

Table 5 

Genotype distributions, MTHFR activity (mnol 
formaldehyde/mg protein/hour) , and total plasma 
homocysteine (tHcy;|jM) for mothers and children 



Mothers (n=141) 





E/E 


E/A 


A/A 




A/A 


AA/ 


VA/ 


A/A 


AA/ 


V/V 


A/A AN 


VA/ 


# 


24 


32 


19 


27 


26 


0 


13 0 


0 


% 

MTHFR 


17 

49.0±18.9 
114] 


23 

33.0±10.8 
[19]* 


13 
15J±4.5 
[11]* 


19 

45.0±16.0 
[15] 


18 

30.2±19.3 
[15]* 


0 


9 0 
32.U9.0 - 

m* 


0 


THcy 


9.5±3.1 
[24] 


10.0±3.2 
[32] 


12.2±7.1 
[19]** 


8.4±2.1 
[25] 


10.0±3.1 
[26] 




9.5±2.0 
[13] 






Children {n=133) 




# 


23 


43 


18 


20 


15 


1 


13 0 


0 


% 

MTHFR 


17 

52.0±17.0 
[12] 


32 

38.2±15.0 
[27]* 


13 
16.2±5.3 
[11]* 


15 
35.7±9.7 
[18]* 


11 

26.1 ±5.0 
[9]* 


1 

21.6 
[1] 


10 0 
29.5±10.3 - 
[6]* 


0 


THcy 


7.6±2.5 
[23] 


8.2 ±3.0 
[43] 


9.7±5.1 
[18] ** 


7.5±2.3 
[20] 


8.1±2.8 
[15] 


9.5 
[1] 


7.4±1.5 
[13] 





The three A1298C genotypes and the three C677T genotypes are designated by the amino 
acid codes: EE. EA. AA. and AA, AV. W, respectively. Statistical significance was assessed 
by student t-test. in comparison with EEAA values. * (p<0.05); ** (p<0.07). Standard 
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deviations are given and square brackets indicate the number of individuals for whom 
MTHFR activities and homocysteine levels were available. 

The frequencies of the three genotypes for the 
5 A1298C mutation (EE, EA and AA) were not different 

between case and control mothers, or between case and 
control children (data not shown) . Consequently, all the 
mothers and all the children were grouped together for 
analyses (Table 5) . Nine % of mothers had the homozygous 
10 AA genotype while 37% were heterozygous. This frequency 
is quite similar to the frequency of the homozygous 
mutant genotype (W) for the C677T polymorphism. In the 
MTHFR human cDNA sequence mentioned above, the cDNA 
contained the C nucleotide at bp 1298 change as a C1298A 
substitution. Since the A nucleotide is clearly the more 
frequent base at this position, the A1298C nomenclature 
was chosen. 

Since the C677T mutation (A to V) decreases MTHFR 
activity and increases homocysteine levels, the three 
20 genotype groups for the A1298C (E to A) mutation were 
further stratified by the genotype for the A to V 
mutation, to avoid the confounding influence of the 
latter polymorphism on MTHFR activity and homocysteine 
levels. The frequencies of the 9 genotypes, with MTHFR 
25 activity and homocysteine levels for each genotype, are 
shown in Table 5. If the mothers and children without 
either mutation i.e. EE/AA are used as the reference 
(control) group, the mothers and children that are 
homozygous for the A12S8C mutation (AAAA) have 
30 approximately 65% and 57%, respectively, of control MTHFR 
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activity. Heterozygotes for the C677T change alone 
(EEAV) have approximately 70% of control activity, as 
reported in other studies, while double heterozygotes 
(EAAV) , 18% of mothers and 11% of children, have an 
5 additional loss of activity (approximately 62% and 50% of 
control values, respectively) . 

Homocysteine levels were not significantly 
increased by the A1298C mutation, but homocysteine was 
elevated (with borderline significance, p<0.07) in 
10 mothers and children who were homozygous for the C677T 
change. The small number of individuals who were 
homozygous for the A1298C mutation (n=13) may have 
influenced the power of the statistical analyses and 
precluded an investigation of the genetic -nutrient 
15 interactive effect that leads to mild 

hyperhomocysteinemia, as seen in individuals with the 

C677T mutation. 

The T1317C substitution does not alter the amino 
acid (phenylalanine) and is likely a benign change, 
20 although a splicing defect cannot be ruled out at the 
present time. In an evaluation of 38 control mothers 
from this study, 2 were found to be heterozygous and one 
was identified as a homozygote, resulting in an allele 
frequency of 5% (4/76) . Since this substitution was 
25 identified in an African-American female, control African 
individuals were also examined (n=9) . Seven of these 
were heterozygous, resulting in an allele frequency of 
39% (7/18) . 
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The A12 98C mutation clearly reduces MTHFR 
activity, albeit to a lesser extent than the C677T 
mutation. Consequently its effect on homocysteine levels 
is also attenuated and, in fact, may only be significant 
when an individual carries both mutations and/or has poor 
nutrient status. However, since double heterozygotes are 
estimated to represent approximately 15% of the 
population, this variant should be examined in 
conjunction with the C677T variant in studies of 
hyperhomocysteinemia . 

The A1298C mutation is clearly polymorphic in 
Canadian individuals and should be examined in other 
populations. The A nucleotide is likely to be the 
ancestral sequence since it represents the more common 
allele, although the original human MTHFR cDNA sequence 
(GenBank accession number U09806) carried the C 
nucleotide. This polymorphism is similar in frequency to 
the C677T polymorphism. Presumably the two substitutions 
arose separately on a A1298/C677 or E/A haplotype, since 
the haplotype with both substitutions (C1298/T677 or A/V) 
is extremely rare. One such haplotype was seen in a 
child with the EAW genotype, suggesting a recombinant 
chromosome . 

Doubly homozygous individuals (AAW) were not 
observed in this study. Since the double mutation in cis 
is rare, it is possible that not enough alleles were 
studied. Larger studies in other populations might 
result in the identification of these individuals. 
Presumably the MTHFR activity would be even lower and 



homocysteine levels might be higher than those observed 
thus far. 

The C677T polymorphism in exon 4 is within the N- 
terminal catalytic domain of the enzyme whereas the 
A1298C polymorphism in exon 7 is within the C-terminal 
regulatory domain. The more dramatic effect on enzyme 
activity with the first polymorphism may be a consequence 
of its location within the catalytic region. The second 
polymorphism could affect enzyme regulation, possibly by 
S-adenosylmethionine, an allosteric inhibitor of MTHFR, 
which is known to bind in the C-terminal region. 

Many studies have examined the effects of the 
C677T polymorphism on MTHFR enzyme activity and on 
homocysteine levels. Although the correlation between 
the presence of this substitution and decreased enzyme 
activity/increased homocysteine levels has been quite 
good, the variability in results, particularly in 
heterozygous individuals, may reflect the presence of a 
second common variant in the population. 

The third variant, T1317C, was present on 5% of 
alleles in Canadian individuals but appears to be 
extremely common in individuals of African ancestry. The 
methodology outlined in this report should be used to 
assess the frequency of the A12 98C and T1317C in other 
populations, since the use of the MboII restriction site 
for analysis of the A12 98C change, as first reported, 
would not discriminate between the 2 polymorphisms. 



The C677T mutation is a risk factor for 
hyperhomocysteinemia and has been implicated in both 
neural tube defects and vascular disease. 

Gene structure of human and mouse 
methylenetetrahydrofolate reductase (MTHFR) 

A human cDNA for MTHFR, 2.2 kb in length, has 

been expressed and shown to result in a catalytically- 
active enzyme of approximately 70 kDa. Fifteen mutations 
have been identified in the MTHFR gene: 14 rare 
mutations associated with severe enzymatic deficiency 
and one common variant associated with a milder 
deficiency. The common polymorphism has been implicated 
in three multifactorial diseases: occlusive vascular 
disease, neural tube defects and colon cancer. The human 
gene has been mapped to chromosomal region lp36.3 while 
the mouse gene has been localized to distal Chromosome 4. 
The isolation and characterization of the human and mouse 
genes for MTHFR is herein reported. A human genomic 
clone (17 kb) was found to contain the entire cDNA 
sequence of 2.2 kb; there were 11 exons ranging in size 
from 102 bp to 432 bp. Intron sizes ranged from 250 bp 
to 1.5 kb with one exception of 4.2 kb. The mouse 
genomic clones (19 kb) start 7 kb 5' to exon 1 and extend 
to the end of the coding sequence . The mouse amino acid 
sequence is approximately 90% identical to the 
corresponding human sequence. The exon sizes, locations 
of intronic boundaries, and intron sizes are also quite 
similar between the two species. The availability of 
human genomic clones has been useful in designing primers 
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for exon amplification and mutation detection. The mouse 
genomic clones may be used to make constructs for gene 
targeting and generation of mouse models for MTHFR 
deficiency. 

A common polymorphism, C677T has been identified, 
which converts an alanine codon to valine (Frosst et al . , 
1995) . This common polymorphism, which is present on 
approximately 35% of alleles in the North American 
population, encodes the thertnolabile variant and 
predisposes to mild hyperhomocysteinemia when folate 
status is low (Frosst et al . , 1995; Jacques et al . , 1996; 
Christensen et al . , 1997). This genetic -nutrient 
interactive effect is believed to be a risk factor for 
arteriosclerosis (Frosst et al, 1995) and neural tube 
defects. In contrast, the mutant homozygous genotype may 
decrease the risk for colon cancer. 

The characterization of the genomic structure for 
human MTHFR is reported herein. The corresponding 
analysis of the mouse gene, with a comparison of the 
overall organization of the gene and the amino acid 
sequences in these two species, is also shown. 
Screening of genomic libraries 

Genomic libraries were screened using standard 
methods of plaque hybridization. The 2.2 kb human cDNA 
was radiolabelled and used as a probe in screening both 
human and murine genomic libraries. Screening for the 
human gene was performed on a phage library of partial 
EcoRI digestion fragments from total genomic DNA (ATCC # 
37385) , and on a phage library of chromosome 1-specific 
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complete EcoRI digestion fragments {ATCC# 57738) . 
Screening for the mouse gene was performed on a XDASH 
library of partial Sau3A digestion fragments from total 
genomic DNA of mouse strain 129SV (obtained from Dr. J. 
Rossant, University of Toronto) . Positive clones were 
purified by sequential rounds of screening and isolation, 
and phage DNA was isolated using phage DNA isolation 
columns (QIAGEN) . Human clones were digested with EcoRI 
to release the inserts, and then with Xbal to facilitate 
cloning into Bluescript plasmid (Stratagene) . The mouse 
clones were digested with Sail or EcoRI, and the inserts 
were subcloned into Bluescript. 
Characterization of mouse cDNA sequences 

Mouse genomic clones were sequenced (Sequenase 
kit, Amersham) using human cDNA primers spanning most of 
the available 2.2 kb cDNA. These sequences were then 
used to generate mouse -specific cDNA primers. The mouse- 
specific primers were used in PGR amplification of 
overlapping cDNA fragments from reverse -transcribed mouse 
liver RNA. The PGR products were subcloned into the 
PGRII vector (Invitrogen) and sequenced. Two different 
species of mouse (G57B1/6J and ct) were used to generate 
MTHFR sequence by RT-PGR, to ensure that the PGR protocol 
did not generate sequencing errors. 

Characterization of intron boxindaries and sizes, and 
restriction analysis of human and mouse genes 

Primers from cDNA sequences of human and mouse 

were used to sequence the respective genomic clones. 
Intron boundaries were determined from regions of 
divergence between cDNA and genomic clone sequences, and 
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by the identification of splice acceptor and donor 
consensus sites. Intronic sequences were obtained for 
40-50 bp from the junctions and are shovm in Figs. 12A- 
12B (human) and Figs. 13A-13B (mouse). The same cDNA 
primers were used in PGR amplification of total genomic 
DNA and of genomic clones to determine the approximate 
sizes of introns in the human and mouse genes. Table 6 
lists the locations and approximate sizes of introns for 
both species. The PGR products were analyzed by 
restriction enzyme digestion to generate a preliminary 
restriction map of the gene. This restriction map was 
then confirmed by restriction analysis of the genomic 
clones in Bluescript . 

Referring to Figs. 12A-12B, the bp location of 
the exons within the cDNA, in parentheses, is based on 
the published human cDNA sequence (GenBank accession 
number U09806) . Bp 1 is 12 bp upstream from the ATG in 
the original cDNA; an asterisk indicates the equivalent 
base here. Exon 1 contains the ATG start site 
(underlined) , and exon 11 contains the termination codon 
(underlined) . Uppercase characters indicate exonic 
sequences, and lower case characters are intronic. 
Gonsensus splice junction sequences are underlined. The 
3' boundary of exon 11 has been designated by the 
location of the polyA tail. 

Referring to Figs. 13A-13B, the bp location of 
the exons within the cDNA, in parentheses, is based on 
the equivalent bp 1 of the human sequences in Figs. 12A- 
12B (bp 1 is indicated by an asterisk) . Exon 1 contains 
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the ATG start site (underlined) , and exon 11 contains the 
termination codon (doubly underlined) . Uppercase 
characters indicate exonic sequences, and lower case 
characters are intronic . Consensus splice junction 
sequences are underlined. The 3' boundary of exon 11 is 
designated as the termination codon, since the site of 
polyadenylation is unknown. Also underlined in exon 11 
is the first repeat of the 52 bp repeated element. 

Referring to Fig. 14, exon sizes for human and 
mouse are reported in Figs. 12A-12B and Figs. 13A-13B, 
respectively. Exons are indicated in shaded boxes. 
Uncharacterized regions of the gene are hatched, and exon 
numbering corresponds to Figs. 12A-12B and 13A-13B. 
E=EcoRI; X=XbaI; An=polyadenylation site. The EcoRI 
restriction site at the 5' end of the mouse gene is part 
of the phage polyl inker sequence. 

Referring to Fig. 15, residues that are identical to 
the human MTHFR sequence are shown as empty boxes, and 
gaps in amino acid homology are represented by a dash. 
Human genomic clones 

The genomic clones isolated from the human 
libraries contained a 16 kb EcoRI fragment, encompassing 
part of exon 1 and exons 2 through 11, and a 1 kb EcoRI 
fragment containing most of exon 1. Exon 1 is defined as 
the most 5' exon from the previously published cDNA 
sequence; it contains the ATG start site that was used to 
express the human cDNA in bacterial extracts (Frosst et 
al. 1995) . A graphic representation of the human gene 
and its restriction map are depicted in Fig. 3. The 
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sequences of each exon and 50 bp of flanking intronic 
sequences are shown in Figs. 12A-12B. Exons range in 
size from 102 bp to 432 bp, and the critical 
dinucleotides in the 5' and 3' splice site consensus 
5 sequences (GT and AG, respectively) are underlined. The 
3' boundary of exon 11 is defined by the site of 
polyadenylation in the cDNA; a possible polyadenylation 
signal (AACCTA) is present 15 bp upstream of the 
polyadenylation site, although it varies from the 
10 consensus sequence. Table 6 lists the locations and 
approximate sizes of introns as determined by PGR 
amplification; the introns range in size from 
approximately 250 bp to 4.2 kb. 
Mouse genomic clones 
15 Genomic clones isolated from the mouse libraries 

were digested with EcoRI for subcloning and 
characterization. Exon nomenclature is based on the 
corresponding human gene sequences. Figs. 13A-13B list 
all known exons and their sizes, with 40 to 50 bp of 
20 flanking intronic sequences. Fig. 14 shows a graphic 
representation of the mouse genomic structure aligned 
with the human gene. The size of exon 11 is 
undetermined, since the sequence for this region was 
determined directly from the genomic clones. The 
25 termination codon is located within a region of 52 bp 

which is repeated 3 times in the gene. The significance 
of this, if any, is unknown at the present time. The 
dinucleotides of the splice junctions (underlined in 
Figs. 13A-13B) are in agreement with consensus sequences. 



Table 6 lists the approximate sizes of introns as 
determined by PGR, and their bp location in the cDNA. 
The introns range in size from approximately 250 bp to 
4.2 kb. 

Comparison of the himan and mouse genes 

The human and mouse genes are very similar in 
size and structure (Fig. 14). The introns are similar in 
size, and identical in location within the coding 
sequence. However, the mouse cDNA is one amino acid 
shorter in exon 1 which causes a shift in bp numbering of 
the mouse cDNA (Table 6/ Figs. 13A-13B) . Exon 1 was 
defined from the original published human cDNA, based- on 
the presence of a translation start codon. In both human 
and mouse genes, the 5' boundary of exon 1 was assigned 
after the isolation of several non-coding cDNA 
extensions that are generated by alternative splicing 
from this junction. Characterization of these 5' cDNA 
extensions is in progress. The nucleotide sequences of 
the human and mouse genes are very similar within coding 
regions, but homology decreases dramatically in the 3' 
UTR region and within introns. 

Human and mouse primary amino acid sequence homology 

The primary amino acid sequences of human and mouse 
were compared to each other, and aligned with the 
sequence of the MetF (MTHFR) enzyme from bacteria (Fig, 
15) . The human and mouse amino acid sequences are almost 
90% identical. As previously observed, only the 5' half 
of the mammalian sequences align with the bacterial 
enzyme; bacterial MTHFR has the same catalytic activity 
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as the mammalian enzyme but lacks the regulatory region 
in the C- terminal domain. The murine amino acid sequence 
is two amino acids shorter than the human sequence: one 
less amino acid in exon 1 and one less in exon 11. 

The isolation of the human MTHFR gene and the 
analysis of gene structure are part of an ongoing effort 
to study MTHFR deficiency in homocystinuria and in 
multifactorial diseases. The availability of genetic 
structure information and of intronic sequences will help 
in the mutational analysis of patients suffering from 
MTHFR deficiency and in the characterization of the 5' 
regulatory region. 

Expression analysis of the 2.2kb cDNA in a 
bacterial expression system resulted in a catalytically- 
active VOkDa protein (Frosst et al . 1995). A MTHFR 
polypeptide of this size was observed in some human 
tissues on Western blots, but a larger isozyme (77 kDa) , 
corresponding to the estimated size of the porcine 
polypeptide, was observed in all the examined tissues. 
These data suggested the presence of protein isoforms for 
MTHFR that could be tissue-specific (Frosst et al . , 
1995) . Since human or mouse sequences homologous to the 
N-terminal porcine amino acid sequences have not been 
identified, it is assumed that the missing sequences 
required to encode the larger isoform are 5' to the 
available cDNA sequences. Two mRNAs for human MTHFR 
(approximately 7.5 and 8.5 kb) have been seen in all 
tissues on Northern blots (data not shown) , suggesting 
very large UTRs. The isolation of 5' coding sequences 
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has been complicated by the presence of several 
alternatively- spliced 5' non-coding extensions that 
splice into exon 1. The alternative splicing into exon 1 
has been observed in both human and mouse MTHFR. The 
long UTRs and the alternative splicing events suggest 
that the regulation of this important gene may be quite 
complex. 

Nonetheless, the available information has been 
critical for identification of mutations in patients with 
various forms of MTHFR deficiency. The mouse sequences 
in exons 1 and 2 have been useful in the design of 
antisense oligonucleotides to successfully inhibit MTHFR 
in mouse embryo cultures and disrupt development of the 
neural tube (Lanoue et al . 1997). The isolation and 
characterization of mouse genomic clones is essential for 
construction of targeting vectors to generate mouse 
models for MTHFR deficiency. 
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Table 6: Approximate sizes of introns, and their 
locations in human and mouse MTHFR cDNA 



Intron 



Approximate 
size (kb) 



Human 
location^ 



Mouse 
location^ 



1 


1 . b 


^ 1 O ^ ^ 


245-246 


2 


0 . 8 


487-488 


484-485 


3 


4.2 


598-599 


595-596 


4 


0.8 


792-793 


789-790 


5 


0 .35 


1043-1044 


1040-1041 


6 


0 .25 


1178-1179 


1175-1176 


7 


0.3 


1359-1360 


1356-1357 


8 


1.5 


1542-1543 


1539-1540 


9 


1.3 


1644-1645 


1641-1642 


10 


0.3 


1764-1765 


1761-1762 

•3 nr»1 H H 12 



5 upstream from the ATG, as in the original report of the cDNA 
sequence (Goyette et al . 1994). 

Various doses of methotrexate (a drnag used in 
treatment of cancer and arthritis, possibly other 

10 diseases) were added to colon carcinoma lines in culture. 
Much lower doses of methotrexate are needed to kill the 
lines that carry the 677C/T mutation in MTHFR, compared 
to lines that do not carry this mutation. The IC50 
(concentration needed to kill half the cells) is 

15 approximately 20 nM for lines with the mutation and 

approximately 150 nM for lines without the mutation. To 
extrapolate to the human condition, patients with this 
MTHFR mutation might require lower doses of methotrexate 
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for therapy, or might be subject to methotrexate toxicity 
at high doses. 

Table 7 
Stunmary of BMD analysis 



Entire cohort: Values are means +/- SE 





Genotype 


1/1 


Genotype 


1/2 


Genotype 


2/2 


Spinal Z score 


-1.06 (0. 


193) 


-1.25 (0. 


176) 


-1.86 (0. 


238) 


Femoral neck Z score 


-0.69 (0. 


134) 


-0.78 (0. 


126) 


-0.87 (0. 


228) 


Trochanter Z score 


-0.60 (0. 


142) 


-0.52 (0. 


134) 


-1.15 (0. 


249) 


Ward's triangle Z score 


-0.67 (0. 


147) 


-0.80 (0. 


139) 


-0.96 (0. 


257) 
litipH fnr 



Table 7 indicates the values for bone mineral density in a group of individuals who were examined 
the MTHFR C677T mutation. 
Genotype 1/1 = normal C/C 
Genotype 1/2 = earners C/T 
Genotype 2/2 = homozygous mutant T/T 



As seen on Table 7, the lower the score, the 
lower the bone mineral density and therefore the higher 
the risk for osteoporosis. The results suggest that the 
homozygons mutant genotype (2/2) is asssociated with 
lower bone mineral density and therefore higher risk of 
osteoporosis . 

Diagnosis and Therapeutic Outcome of a Psychosis 

MTHFR alleles are also useful as diagnostic, 
therapeutic, prognostic, and pharmacogenomic markers for 
neurological disorders such as psychoses and in other 
diseases and disorders that may be treated with anti- 
psychotic therapeutics. 
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Psychoses are serious, debilitating mental illnesses 
which are characterized by loss of contact with reality 
and may also include any of the following symptoms: 
delusions, hallucinations, thought disorder, strange 
5 behavior or emotions, or withdrawal from social contact. 
The duration of untreated psychosis may be correlated 
with worsened patient -prognosis . 

Schizophrenia is the most common psychosis, which 
affects approximately 2 million Americans. Although 

10 genetic factors are known to play a role in the etiology 
of schizophrenia, identification of susceptibility genes 
has been difficult. This difficulty may be due partially 
to the variability of the schizophrenia phenotype, 
including variations in severity of symptoms, response to 

15 medications, and long-term outcome. Indeed, one third to 
one quarter of schizophrenic subjects remain severely 
symptomatic despite multiple trials with individual or 
combinations of ant i -psychotic therapeutics, also called 
neuroleptics. This variability may reflect factors such 

2 0 as genetic heterogeneity or the presence of modifier 

genes. Compared to neuroleptic nonresponders, the group 
of responders is characterized by a high female to male 
ratio, better long-term outcome, and more frequently 
disturbed indices of dopamine neurotransmission. 

25 In addition to the inability to reduce symptoms in a 

substantial number of schizophrenia patients, current 
neuroleptic therapeutics also cause a variety of serious 
side effects. Examples of the adverse effects of 
conventional neuroleptics include cardiac conduction 
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abnormalities can result in patient deaths, seizures, 
involuntary repetitive movements (tardive dyskinesia) , 
dulling of cognition, sedation, weight gain, sexual or 
reproductive dysfunction, blood dyscrasias, jaundice, 
5 skin reactions such as uticaria and dermatitis, and 
extrapyramdial effects. Extrapyramdial side effects 
involve conditions such as abnormal facial expressions, 
restlessness or constant movement, and rigidity and 
tremor at rest (parkinsonian syndrome) . Because of the 
10 severity of these side effects and the low therapeutic- 
to- toxic index of conventional neuroleptics, other 
neuroleptics, called atypical neuroleptics, have been 

^ recently developed. Atypical neuroleptics have a lower 

hi 

|j, incidence of extrapyramidal symptoms and tardive 

15 dyskinesia; however, they are still associated with 
yi weight gain and effects on blood pressure and liver 

J.^ function, as observed for conventional neuroleptics, 

ftl Due to the severity and heterogeneity of 

schizophrenia, improved methods are needed for diagnosing 
J'=';- 2 0 people at risk for or suffering from schizophrenia. 

Improved methods are also desirable for determining a 
preferred therapy for a particular diagnosed individual 
or group of individuals. The selection of a preferred 
therapy for a particular subject may prevent the subject 
25 from having to endure possibly irreversible side effects 
from therapies that are ineffective for that subject. 
Reduced side effects of the preferred therapy compared to 
other therapies may result in greater patient compliance. 



further increasing the likelihood of therapeutic benefit 
from the therapy. 

We have discovered that the heterozygous form of the 
C677T mutation in methylenetetrahydrof olate reductase 
(MTHFR) is more common in patients diagnosed with 
schizophrenia than in a healthy control population and 
thus is a risk factor for this disease. In addition, 
both the heterozygous and homozygous forms of this 
mutation are associated with an improved response to 
neuroleptic treatment and long-term outcome. 

Because the C677T mutation results in decreased 
MTHFR activity and increased enzyme thermolability, other 
MTHFR genotypes or haplotypes having similar 
characteristics may be associated with schizophrenia or 
improved response to neuroleptics. For example, another 
mutation of human MTHFR, A1298C which converts glutamic 
acid to alanine, also results in reduced activity in the 
encoded protein, and individuals who are heterozygous for 
both the C677T and A12 98C mutations have even lower MTHFR 
activity and have higher homocysteine levels than 
observed in individuals with the single C677T mutation. 
Other similar mutations include, but are not limited to, 
G167A, G482A, C559T, C692T, C764T, a G/A G792+1A, C985T, 
C1015T, C1081T, T1317C, and any combination of these 
MTHFR mutations. 

In this study, two groups of schizophrenic patients 
selected on the basis of a priori defined criteria of 
long-term outcome and response to neuroleptics (excellent 
responders and very poor responders. Table 8) and a 
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control group of health volunteers were compared with 
regard to the C-to-T substitution at nucleotide 677 of 
the MTHFR gene, called "V allele." 

Table 8 



Demographic and clinical characteristics of patients and 

controls 





Nonresponders 


Responders 




(n=62) 


(n=43) 


Age (yr.) ± SD 


38.7 ± 6.9 


40.6 ± 10.5 


Gender (% Males) 


74 


67 


Age at 1 C (yr . ) 


18.1 ± 3.9 


24.2 ± 4.8** 


Subtype, U/P/D/C 


26/30/4/1 


6/36/1/0** 


Time as in-patient {%) 


63.5 ± 38 


8.2 ± 11.8** 


BPRS score 


48.9 ± 9.0 


24.4 ± 3.9** 



Age at 1 C = age at first contact with a psychiatric care faciUty, U = undifferentiated, 
P = paranoid, D= disorganized, C = catatonic, BPRS = Brief psychiatric rating scale. 



The heterozygous and homozygous C677T mutations were found more 
frequently in schizophrenia patients than in control subjects (Table 9). In 
addition, there was a significant association between schizophrenia and the 
presence of at least one C677T allele of the MTHFR gene (x^ = 5.44, degrees 
of freedom (df) = 1, p = 0.019). This association was due to an 
over-representation of allele V in neuroleptic responder patients compared to 
controls (x^ = 16.77, df = 1, p = 0.00004); non-responder patients did not 
differ from controls. In addition, the attributable fraction associated with 
carrying at least one V allele in the group of neuroleptic responder patients was 
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75%, meaning that 7 5% of the schizophrenia cases may be 
due to this mutation. The association between responder 
patients and the C677T mutation was statistically 
stronger in males than in females. 

The association between the C677T mutation and 
schizophrenia was observed for both a recessive (W vs. 
AV+ AA) as well as a dominant model (W + AV vs . AA) of 
transmission of this mutation; however, the latter showed 
consistently more power to discriminate between 
responsive patients and controls (Table 9) . 

These results indicate that good response to 
neuroleptic medication and good long-term outcome 
characterize the schizophrenia phenotype associated with 
allele V of the MTHFR gene. The fact that no association 
with the V allele was observed in the groups of 
neuroleptic nonresponder patients indicates, as 
previously suggested, that these two groups of patients 
are at least partially different from a pathogenic point 
of view. The dosage effect of allele V on age-at-onset , 
a marker of severity of illness that is independent from 
responsiveness to neuroleptics, suggests that this 
association is possibly mediated by a schizophrenia 
subtype. Thus, the classification of schizophrenic 
patients by the presence or absence of the C677T mutation 
may be an effective strategy to reduce heterogeneity for 
studies of the treatment or biology of schizophrenia. 
Additionally, the association of allele V with the 
ability of neuroleptic treatment to reverse psychotic 
symptoms and possibly the deterioration associated with 
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untreated psychosis may be due to the association of 
allele V with an ability to respond to neuroleptics that 
is independent of schizophrenia subtype. 

While not meant to limit the invention to a 
particular theory, it also possible the V allele is 
associated with a disturbance in the one-carbon cycle 
metabolism regulated by MTHFR, since MTHFR provides the 
carbon group for the generation of S-adenosyl methionine, 
a universal methyl donor. A disturbance of one-carbon 
metabolism may result in a deficiency of methylation 
reactions that are required for numerous pathways 
including detoxification, DNA methylation and regulation 
of biogenic amine metabolism. Another possibility is 
that the V allelic variant may act as a moderator of the 
severity of schizophrenia by increasing the plasma 
concentration of homocysteine. Indeed it has been 
reported that a homocysteine load causes worsening of 
psychotic symptoms in schizophrenic patients. 

The association of other MTHFR alleles with 
schizophrenia, or any other disease, or with response to 
particular therapies may be determined by any standard 
methods (see, for example, those methods described in WO 
00/04194; Ueda et al . , Circulation 98:2148-2153, 1998, 
Tan et al . , Lancet 350:995-999, 1997; Poirier et al . , 
Proc. Natl. Acad. Sci. USA 92:12260-12264, 1995; Spire- 
Vayron de la Moureyre et al . , British J. of Pharmacology 
125:879-887, 1998; Drazen et al., Nature Genetics 22:168- 
170, 1999; Smeraldi et al . , Molecular Psychiatry 3:508- 
511, 1998; Kuivenhoven et al. New Eng. J. of Med. 



- 84 - 



338 (2) : 86-93 , 1998; Aarranz et al . , Molecular Psychiatry 
3:61-66, 1998) . Haplotyping may also be used to 
determine associations between particular mutations or 
combination of mutations and particular diseases or 
responses to particular therapies. 

Clinical trials for an ant i -psychotic therapy may be 
conducted as described below or using any other 
appropriate method. The subjects in the trial may be 
stratified based upon their MTHFR genotype at the 
relevant polymorphic sites. This stratification may be 
used to correlate an MTHFR genotype or haplotype with the 
efficacy or the safety of the therapy and may be 
performed before, during, or after the clinical trial. 
The correlation of the safety and efficacy of the therapy 
in subjects with a particular MTHFR mutation or 
combination of MTHFR mutations may be considered during 
the Food and Drug Administration (FDA) approval process 
and used to determine preferred therapies for the 
different MTHFR subgroups of subjects. 

Briefly, clinical trials generally consist of four 
phases: Phase I is a safety study generally involving 
healthy volunteers; Phase II is a dose-ranging, efficacy 
study involving subjects diagnosed with a condition, and 
Phase III is a larger safety and efficacy study. If the 
efficacy data from the Phase III trial are statistically 
significant, the therapy may be approved by the FDA. 
Phase IV is a post -marking approval trial that may be 
used to further refine the approved indication population 
or to compare one therapy to another one. This process 
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for testing a therapy in humans may also be modified by 
one skilled in the art depending on specific 
pharmacokinetic or pharmacodynamic properties of the 
therapy or the availability of subjects diagnosed with 
the condition. For example, cytotoxic therapies, such as 
cancer chemotherapeutics , are not tested in healthy 
control subjects and generally begin clinical trials with 
a Phase 11 study. 

Clinical trials may be performed using standard 
protocols (see, for example, Spilker, Guide to Clinical 
Trials, Raven Press, 1991; Tygstrup, ed., The Randomized 
Clinical Trial and Therapeutic Decisions Marcel Dekker; 
Thall, ed. Recent Advances in Clinical Trial Design and 
Analysis, Cancer Treatment and Research, Ctar 75, Kluwer 
Academic, 1995; and Piantadosi, Clinical Trials: A 
Methodologic Perspective, Wiley Series in Probability and 
Statistics, 1997) . Examples of possible clinical trials 
protocols include placebo-controlled, single blinded, 
double blinded, crossover, and other studies. 

For the design of clinical trials to test the 
contribution of an MTHFR mutation or a combination of 
MTHFR mutations to variation in patient response to a 
therapy, one skilled in the art may use readily available 
methods to determine the appropriate parameters for a 
particular clinical trial. Examples of such factors 
include the genetic hypothesis to be tested, the number 
of subjects required to observe a statistically 
significant effect (power analysis) , the appropriate 
dosing range, the duration of the trial, and the 
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appropriate endpoints to measure the response to the 
therapy. The applications of genetic hypothesis testing, 
power analysis, and statistical analysis of patient 
response to each phase of a clinical trial have been 
described previously. Additionally, haplotyping may be 
used to detect which MTHFR genotype at which position may 
be correlated with a particular response to a particular 
therapy . 

The endpoints of a clinical trial of an anti- 
psychotic therapy may include factors associated with the 
conditions. Examples of possible endpoints include semi- 
objective grading scales administered by clinical staff. 
Several standard grading scales have been developed and 
accepted as assessment tools of the efficacy of a therapy 
in clinical trials and include, but are not limited to, 
cognitive assessment, pain, psychotic, behavioral, and 
global clinical impression scales. 

A further consideration for the design of a clinical 
trial for a compound in the category of psychiatric 
illness is the duration of the trial. It may take weeks 
to months before a "wash-out" can be achieved (i.e. 
reduction of the dose to the point of removal of therapy) 
for the therapies that the subjects had been receiving 
prior to the clinical trial. Additionally, many anti- 
psychotics exhibit dose -dependent efficacy and often 
require 2-3 weeks before empirical clinical global 
impression and semi -objective grading scale analysis 
suggests a therapeutic dose has been achieved. Both the 
"run-in" period for the achievement of a therapeutic dose 
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and the "run-out" period for the subsequent decrease in 
the dose for removal of the therapy after the clinical 
trial must be performed under controlled conditions. 

The present invention provides a number of 
advantages. For example, the methods described herein 
allow for the determination of a subject at risk for a 
psychosis for the timely administration of a prophylactic 
therapy to prevent or delay the psychosis. Thus, this 
determination of increased risk for a psychosis may occur 
before the presence of debilitating symptoms that might 
otherwise be required for a definitive diagnosis. The 
methods of the invention also have the advantage of only 
requiring the presence of one mutation in one MTHFR 
allele instead of requiring a homozygous mutation for the 
association of the mutation with a psychosis; this is a 
particular advantage in the diagnosis of schizophrenia 
which was previously correlated exclusively with 
homozygous patient genotypes. Additionally, the 
presence of mutations in MTHFR alleles may be easily and 
rapidly determined^ using standard techniques. Moreover, 
the administration of a preferred therapy to subject at 
risk for or diagnosed with from a psychosis may result in 
improved pat lent -out come, improved drug- efficacy, or 
reduced drug- toxicity compared to the administration of 
other therapies. These results may reduce healthcare, 
costs by decreasing the number of required doctor visits 
or hospitalizations. 
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The following example is provided to illustrate the 
invention. It is not meant to limit the invention in any- 
way. 

Association of the C677T MTHFR mutation with 
schizophrenia/ response to therapy, and long- term outcome 

Selection of patients 

Nonresponding schizophrenic patients (NR, n = 43) 
were selected from a list of schizophrenic patients 
identified as candidates for treatment or treated with 
atypical neuroleptics because of treatment resistance to 
conventional neuropleptics . Three institutions provided 
these NR patients: Douglas Hospital, Clinique Jeunes 
Adultes of L.H. Lafontaine Hospital, and the 
Schizophrenia Clinic of the Royal Ottawa Hospital. 
Responding patients (R, n= 62) were selected from a list 
of all patients who were considered very good responders 
to conventional neuroleptics by their treating physician 
and/or nurse and who were followed in the outpatient 
clinics attached to the Douglas and L.H. Lafontaine 
hospitals, which are the only two university-affiliated 
psychiatric hospitals for the Montreal intra-muros 
region. 

The following criteria for response or resistance to 
conventional neuroleptics were used in the selection of 
NR schizophrenic patients. None of the NR patients had 
experienced remission of psychotic symptoms within the 
past two years. In the preceding five years, all NR 
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patients had undergone at least three periods of 
treatment with conventional neuroleptics from at least 
two distinct families of drugs at a dose equal to, or 
greater than, 750 mg chlorpromazine (CPZ) equivalents for 
monotherapy or 1000 mg CPZ for therapy with a combination 
of neuroleptics. The treatments, which lasted for a 
continuous period of at least 6 weeks, resulted in no 
significant decrease in symptoms. Finally, all patients 
were unable to function without supervision in all, or 
nearly all, domains of social and vocational activities 
and had a Global Assessment Score (GAS) (Endicott et al . , 
Arch. Gen. Psychiatry 33:766-771, 1976) greater than 40 
within the last 12 months. Decision for non-response was 
established by the research psychiatrist upon review of 
medical records, with particular attention to: (i) 
unusual thought content, (ii) hallucinations, (iii) 
conceptual disorganization, (iv) motor retardation, and 
(v) emotional withdrawal (items 3,4, 12, 13 and 15 of 
the Brief Psychiatric Rating Scale; BPRS) (Woerner et 
al., Psychopharmacol . Bull. 24:112-117, 1988). At the 
time of enrollment, a minimal score of 4 (moderate to 
severe) on at least three of the five BPRS items (3, 4, 
12, 13, 15), a total BPRS score of at least 45 and/or a 
CGI (Clinical global Impression) score of at least 5 
(markedly ill) were required. These criteria for 
treatment resistance were derived from those of Kane et 
al. (Arch. Gen. Psychiatry 45 (9) : 789-96 , 1988). 

Among the selected R patients, all patients had been 
admitted at least once to a psychiatric institution 
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because of an acute psychotic episode. During each 
hospitalization, patients experienced a full or partial 
remission in response to treatment with conventional 
neuroleptics within 6 to 8 weeks of continuous treatment. 
All patients were able to function autonomously with only 
occasional supervision in all, or nearly all, domains of 
social and vocational activities. None of the R patients 
had to be admitted to hospitals because of psychotic 
exacerbation while under continuous neuroleptic 
treatment. All R patients had at least one psychotic 
relapse when neuroleptic medication was reduced or 
discontinued. Remission was defined as a complete or 
quasi -complete disappearance of schizophrenic symptoms, 
with limited residual symptoms, based on the treating 
psychiatrist clinical evaluation and hospital records. 
At the time of enrollment, total BPRS scores were less 
than 3 0 with no more than one item scoring 4 and/or a CGI 
score less than 3 (borderline mentally ill) . 

Both NR and R patients were directly interviewed 
using the Diagnostic Inteirview for Genetic Studies (DIGS) 
(Nurnberger et al . , Arch. Gen. Psychiatry 51:849-859, 
1994) , and their medical records were comprehensively 
reviewed by a research psychiatrist. Complementary 
information from the treating physician and nurses in 
charge was also obtained. The severity of the clinical 
syndrome at the time of examination was evaluated by the 
BPRS, and the diagnosis as meeting the criteria for NR or 
R patients was based on all the available data. 



The 90 control patients (C) were determined not to 
have any DSM-IV axis I mental disorders based on DIGS. 

All subjects were Caucasian with a majority (52 %) 
of French Canadians. This project was approved by the 
Research Ethics Board of each participating institution. 

Genetic analysis 

Genomic DNA was isolated from peripheral lymphocytes 
using standard methods. The target sequence was 
amplified by PGR using the published procedure of Frosst 
et al. (Nat. Genet. 10(1):111-113, 1995). PGR products 
were digested with Hinfl because the C677T mutation 
creates a Hinfl recognition sequence. The PGR product of 
198 bp was digested into fragments of 175 bp and 23 bp if 
the G677T mutation was present, while the wild- type 
(alanine) allele remained undigested (198 bp) . The 
digested PGR products were electrophoresed on 9% 
polyacrylatnide gels and photographed. Genotyping and 
scoring were performed by someone blinded to the 
diagnosis . 

Statistical analysis 

The allele frequencies in the groups of 
schizophrenia subjects and controls were compared using a 
Chi square statistic with one degree of freedom. Because 
a statistically significant difference between patients 
and controls was observed in the frequency of allele V, 
pair-wise comparisons between the three groups of 
subjects (R, NR, and C) were subsequently conducted. To 
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test for a population bias within the subjects tested, an 
analysis between the R, NR, and C patients for whom both 
parents were French Canadian and for whom at least one 
parent did not have French Canadian ancestry was also 
5 performed. 

Comparison of responding, nonresponding, and control 
groups based on frequence of the C677T MTHFR allele 
R and NR schizophrenic patients did not differ 
10 significantly according to age and gender distributions. 
Controls were significantly older and included more 
i[| females than the R and NR groups. In keeping with the 

l^f^ Study design, the two groups of schizophrenia patients 

H differed significantly according to the severity of 

15 psychosis, the percent of time spent as inpatient since 
y1 their first contact with a psychiatric institution, and 

p their age at first contact with psychiatric care 

J5j facilities (Table 8) . 

Cn Allele V was significantly more frequent in 

n 

n 20 schizophrenic patients, regardless of the quality of 
their response to neuroleptics and long-term outcome, 
compared to controls (x^ = 5.44, df = 1, p = 0.019). 
However, the most striking finding was observed when 
patients were stratified according to their quality of 
25 response to conventional neuroleptics and long-term 
outcome. The group of R patients showed a highly 
significant increase in the frequency of allele V 
(X^ = 16.77, df =: 1, p = 0.00004) in contrast to the 
group of NR patients (x^ = 0.053, df = 1, p = 0.81). The 
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odds ratio associated with carrying at least one allele V 
in the group of responder patients was 6.80 with a 95% 
confidence interval of 5.22 to 8.87. 

The same pattern of differences between responders 
and controls was observed when subjects with two French 
Canadian parents (x^ = 8.91, df = 1, p = 0.002) and 
those with at least one non-French Canadian parent ix^ 
6.71, df = 1, p = 0.009) were compared separately;. The 
similar results observed in both subpopulations compared 
to the overall tested Caucasian population reduces the 
likelihood of a population stratification bias in the 
data. The stratification of the data according to gender 
resulted in a significant difference in males (x^ = 
12.63, df = 1, p = 0.0003) and a trend in females (x^ 
2.58, df = 1, p = 0.10) . 

Both dominant ( W + AV vs . AA) and recessive ( W 
vs. AV+ AA) models of transmission of the C677T mutation 
were tested. For both models, the differences between 
neuroleptic responder patients and controls were 
significant for the whole group of patients and for the 
groups stratified according to background of parents 

(Table 9) . Finally, an ANOVA analysis was performed in 
which the genotype was the grouping variable and the age- 
at-onset was the outcome variable. A tendency toward a 
dose effect of allele V was observed for the age-at-onset 

(F = 2.48, 
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df = {2, 94); p = 0.088). The average ages at onset ± 
(SD) were 19.16 (4.49), 20.91 (5.42) and 22.52 (5.53) for 
patients with genotypes AA, AV, and W respectively. 



Other Embodiments 

While the invention has been described in con- 
nection with specific embodiments thereof, it will be 
understood that it is capable of further modifications 
and this application is intended to cover any variations, 
uses, or adaptations of the invention following, in 
general, the principles of the invention and including 
such departures from the present disclosure as come 
within known or customary practice within the art to 
which the invention pertains and as may be applied to the 
essential features hereinbefore set forth, and as follows 
in the scope of the appended claims. 



